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Overview and conclusions 



Towards a cosmological standard model 

The study of cosmic microwave background anisotropies is one of the pillars of modern 
cosmology. The cosmic microwave background (hereafter CMB) consists of photons left over 
by the hot phase after the Big-Bang and is very homogeneous and isotropic. Its existence 
was predicted by Gamov (1946), and accidentally discovered only much later by Penzias and 
Wilson (Penzias & Wilson, 1965), but it was only in 1992 that the COBE satellite (Smoot 
et a!., 1992) detected the presence of tiny temperature fluctuations (1 part in lOO'OOO), which 
are thought to have been generated by quantum fluctuations in the very early universe. The 
observational study of these temperature fluctuations, known as anisotropies, has been a great 
technological achievement. Over the last ten years, there has been a spectacular advancement 
in the accuracy of measurements, using ground-based, balloon-born and orbital instruments. 
The WMAP satellite (Bennett et al., 2003) has recently measured the anisotropies with a 
precision which, on certain scales, is close to a fundamental statistical limit, called "cosmic 
variance". 

The importance of such a wealth of data for theoretical cosmology cannot be overstated. 
In a few seconds on a desktop computer, it is nowadays possible to produce accurate numerical 
predictions of the statistical distribution of the anisotropies on the sky for any cosmological 
model of interest, i.e. of the CMB angular power spectrum. If the primordial fluctuations are 
Gaussian distributed, then the power spectrum encodes all of the statistical information: its 
computation is based on linear perturbation theory and the underlying physics is well under- 
stood. The detailed shape of the power spectrum carries characteristic signatures depending 
on the value of the late Universe cosmological parameters and on the initial conditions for 
the perturbations. By "late Universe cosmological parameters" we mean the quantities con- 
trolling the expansion history of the Universe, i.e. its matter budget, complemented by some 
description of the reionization history. In the former category, an incomplete list would 
include the Hubble parameter, the energy density in baryons, cold dark matter and dark 
energy, the dark energy equation of state parameter (possibly including a description of its 
time evolution), the neutrino masses and the number of massless families plus the density 
parameters and effective equation of state of any other exotic form of matter one might wish 
to include; specifying how the Universe was reionized in the context of stellar evolution the- 
ory might require three or four additional parameters, which however usually reduce to the 
optical depth to reionization or equivalently to the redshift of reionization, as far as the CMB 
is concerned. Specifying the initial conditions requires the value of "primordial parameters" 
for the amplitudes of the primordial fluctuations in each of the matter components and their 
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scale dependence. 

The fact that CMB anisotropies are sensitive both to the late Universe cosmological pa- 
rameters and to primordial parameters means that CMB observations only constrain a (de- 
generate) combination of both: until now, disentangling the former required rather strong 
assumptions about the nature of initial conditions. Some guidance is offered by the inflation- 
ary paradigm: in its simplest incarnation, the decay of the inflaton field produces adiabatic 
initial conditions, in which there is no fluctuation in the relative number density of the 
species, hence no entropy perturbations ("adiabatic"). The presence of entropy fluctuations 
can excite up to four other non-decaying modes for the perturbations. Those are collectively 
termed "isocurvature", because in three cases the total matter density is unperturbed and 
hence there is no curvature perturbation in the spatial sections either. The observation of 
the first acoustic peak in the CMB power spectrum (Page et al., 2003) at i = 220.1 ±0.8 has 
substantially confirmed the predominance of the adiabatic mode. However, a subdominant 
isocurvature contribution to the prevalent adiabatic mode cannot be excluded: after all, there 
is no compelling reason why the physics of the early universe should boil down to only one 
degree of freedom. 

Even though in principle the number of late Universe parameters can be very large, easily 
exceeding a dozen, only an handful of them seems to be required by the currently available 
observational evidence (Spergel et al., 2003; Tegmark et al., 2004b; Liddle, 2004): 

• the Hubble parameter h ~ 0.7; 

• the density parameter for baryons ^ ~ 0.05; 

• the density parameter for cold dark matter (CDM) O cc j m ~ 0.25; 

• the density parameter for a cosmological constant J7a ~ 0.7; 

• the optical depth to reionization r re ~ 0.15. 

Summed together, Q C( j m + ^fe + ^A ~ 1 imply a flat Universe. The crucial point is that for the 
CMB these results only hold once we make the rather strong assumption of purely adiabatic 
initial conditions. In that case, the primordial parameters reduce to the spectral index for the 
fluctuations, n s ~ 1, and an overall adiabatic amplitude ^4ad- These two quantities together 
with the above five late Universe parameters are what we call "standard CMB parameters", 
because they build the basis of the "concordance model" of present-day cosmology 1 . 

By combining CMB data with other cosmological and astrophysical measurements - such 
as galaxy distribution statistics, supernova? luminosity distance measurements, gravitational 
lensing statistics, Lyman a absorption lines, local determination of the Hubble parameter, 
light elements abundance - we have reached an unprecedented precision in determining the 
standard cosmological parameters, which are now known with an accuracy of a few percent. 
This is even more astonishing if we think that only ten years ago it was only possible for most 
parameters to estimate their order of magnitude. Most importantly, various independent 

1 We do not discuss the possibility of gravitational waves, which are indeed predicted by any inflationary 
scenario; presently there are merely upper limits to their contribution, which could be small enough to be 
very difficult to detect in the CMB. Our discussion here and in the following focuses on the scalar sector 
only. 
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observations - which probe very different epochs of the cosmic history and are based on 
totally different physical processes - seem to be converging to the same answer. 

We are now in a position where we can move on from parameter fitting to model testing: 
in other words, in order to establish a "cosmological standard model" we need to assess the 
consistency and completeness of our theoretical framework. In order to be sure that we can 
trust the error-bars on the standard parameters beyond the quoted statistical error, we have 
to confront ourselves with the question of possible systematic errors in the measurements on 
one side, and of hidden flaws in our theoretical interpretation of the data on the other. Given 
the intrinsic difficulty of many cosmological observations, an assessment of systematic errors 
for a certain data-set can come from the combination with other, independent measurements 
of the same quantity. Discrepancies in the results will indicate a flaw in the underlying theory, 
or in the data, or in both. This is one of the reasons why the comparison of many data-sets 
is so important, the other being that often the combined data have a superior constraining 
power due to the breaking of degenerate directions in parameter space. Prom the point of view 
of model-building, it is now becoming possible to relax some assumptions which were before 
necessary in order to extract from the data any information at all, and thereby check whether 
our results are robust or else whether they critically depend on our prejudices. If it is found 
that our conclusions depend strongly on the underlying model assumptions, then we need 
to critically review our theoretical paradigm and open our mind to alternative explicative 
models. 



Testing the concordance model with the CMB 

The CMB is an excellent testing ground to carry out this program: our theoretical under- 
standing is based on General Relativity and linear perturbation theory, which suffices to 
describe almost all of the relevant physical processes. This makes us confident that we un- 
derstand quite well CMB anisotropics, and we can exploit them to go beyond the standard 
cosmological parameters in two different ways: the first path leads directly to the primordial 
Universe, via the dependence of the CMB on the nature of initial conditions; the second 
approach makes use of the high quality of recent CMB data to look for effects which were 
previously ignored because thought to be irrelevant, but which are now within the constrain- 
ing power of the observations. In both cases, the microwave background plays the role of a 
Universe-sized laboratory for the study of fundamental physics which is often unaccessible 
to any particle physics laboratory. This work pursues both those aspects, as we detail in the 
following. 

In the first part, we introduce in Chapter 1 the homogeneous and isotropic Friedmann- 
Robertson- Walker universe, which is the background on which perturbation theory is built, 
and we briefly present a few other observations which we later compare and combine with 
the CMB. We then give the derivation of all the relevant perturbation equations needed to 
describe the CMB in Chapter 2. Those are applied to the temperature fluctuations in the 
cosmic photons in the second part: in Chapter 3 we obtain under various approximations 
analytical expressions for the growth of perturbations in an Universe containing photons, 
cold dark matter, massless neutrinos, baryons and a cosmological constant; in Chapter 4 we 
present a thorough account of the main features of the CMB temperature and polarization 
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angular power spectra. In particular, we are concerned with characteristic signatures on the 
angular power spectra of the standard cosmological parameters, which constitute the basis 
for their determination using CMB data. We also introduce the most general type of initial 
conditions, which consist of one adiabatic and four isocurvature modes. The third part fo- 
cuses on the interplay between theoretical modelling and observational data. The comparison 
of theoretical models with actual data needs some basis in probability theory and statistics, 
which we give in Chapter 5, emphasizing their application to the problem of parameter esti- 
mation from CMB observations. The last two chapters contain most of the original research 
work, which is developed along the two lines sketched above: Chapter 6 deals with the obser- 
vational consequences and constraints when we add to the standard cosmological parameters 
new quantities describing possible departures from known physics, while Chapter 7 explores 
the consequences of relaxing the fundamental assumption of adiabaticity. 

In § 6.1 we focus on the effective number of massless neutrino families, N c g (Bowen 
et al., 2002). Although in the standard model of particle physics N e g = 3, there are several 
mechanism which would give N e s / 3 as measured by the two cosmological probes we discuss, 
namely Big-Bang Nucleosynthesis (BBN) combined with observations of the light elements 
abundances, and CMB. This is because both of them are sensitive not only to the number 
of weakly interacting neutrinos, but rather to the total energy density of relativistic particles 
which sets the expansion rate at early times, and therefore can constrain e.g. the existence of 
sterile neutrinos unobservable in Z-decay experiments. Using pre- WMAP CMB data alone, 
we obtain fairly broad bounds on Ngg, 0.04 < N e g < 13.37 with 2a likelihood content, 
which are reduced by including prior information coming from supernovas luminosity distance 
measurements and large scale structure observations. We show that N e ff, or equivalently 
uj ie \ = f2 re i/i 2 , the energy density parameter in relativistic particles, is nearly degenerate 
with the amount of energy in matter, uj m = f2 m /i 2 , and that its inclusion in CMB parameter 
estimation also affects the constraints on other parameters such as the curvature or the scalar 
spectral index of primordial fluctuations. However, even though this degeneracy has the effect 
of limiting the accuracy of parameter estimation from the WMAP satellite, we find that it can 
be broken by measurements on smaller scales such as those provided by the Planck satellite 
mission. We forecast that Planck will be able to constrain A e fj within 0.24 (lcr). 

The primordial 4 He mass fraction, Y p , is predicted by BBN along with the abundances of 
the other light elements as a function of two free parameters, namely the baryon density 
and the relativistic energy density ui re \. If we fix A e fj = 3 and thereby uj re \ as motivated by the 
particle physics standard model, then in standard BBN the abundances of D, 3 He, 4 He and 
7 Li depend on the baryon density alone: comparison with the observed values in astrophysical 
systems indicates a slight discrepancy, which however presently cannot clearly be ascribed to 
systematical errors or to deviations from the standard BBN scenario. We explore in § 6.2 the 
potentiality of using the CMB as a totally independent way of measuring Y p via its impact 
on the reionization history, thereby possibly allowing to discriminate between the various hy- 
pothesis (Trotta <fc Hansen, 2004). We find that WMAP data give only a marginal detection, 
0.160 < Y p < 0.501 at 68% likelihood content. We estimate that the Planck satellite will 
determine the helium mass fraction within 5% (or AY p ~ 0.01), which however will only allow 
a marginal discrimination between different astrophysical measurements. Equally important, 
we identify degeneracies between Y p and other cosmological parameters, most notably the 
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baryon abundance, the redshift and optical depth of reionization and the spectral index; we 
conclude that even though present-day CMB data accuracy does not require the inclusion 
of Y p as a free parameter, the uncertainty of the helium fraction will have to be taken into 
account in order to correctly estimate the errors on the baryon density from Planck. 

The search for observational evidence for time or space variations of the "fundamental" 
constants that can be measured in our four-dimensional world is an extremely exciting area of 
current research, with several independent claims of detections in different contexts emerging 
in the last few years, together with other improved constraints. Most efforts have been 
concentrating on the fine-structure constant, a, both due to its obviously fundamental role 
and to the availability of a series of independent methods of measurement. Of particular 
interest is the result of Webb and collaborators, who claim a 4cr detection of a fine-structure 
constant that was smaller in the past (Murphy et al., 2003; Webb et al., 2003). Noteworthy 
among the possibilities of independently check those results is the CMB, which probes ad eC) 
the value of a at decoupling, z ~ 1100 (Martins et al., 2002, 2004; Rocha et al, 2004). As we 
show in § 6.3, by analyzing the first year WMAP data for time-variations of a we obtain the 
constrain 0.95 < a^ ec /ao < 1.02 with 95% likelihood content, where ao denotes the present 
value. We clarify the issue of degeneracies between a and other standard parameters, and 
give exhaustive forecasts of the expected performance of the full four year WMAP data, of 
the Planck satellite and of an ideal CMB experiment. We emphasize the role of polarization 
measurements to lift flat directions (i.e., degeneracies) in parameter space, and discuss the 
role of reionization in the determination of «dec- 

In Chapter 7 we relax the assumption of adiabaticity by allowing for the most general 
initial conditions (Bucher et al., 2000) and we investigate two complementary aspects: the first 
is the degradation in the accuracy of the late Universe standard parameters as a consequence 
of the introduction of new degrees of freedom in the primordial Universe (Trotta et al., 
2001); the second is the robustness of the measurement of a non-zero cosmological constant, 
Oa 7^ 0, when different statistical approaches (frequentist rather then Bayesian) are applied 
to the data, or when general isocurvature modes are included in the analysis (Trotta et al., 
2003). We also explicitly test the paradigm of adiabaticity by using CMB observations to 
put constraints on the isocurvature contribution. 

For the first point, the results in § 7.2 demonstrate that the determination of the Hubble 
parameter and the baryon density from pre- WMAP CMB data is essentially impossible with- 
out strong assumptions about the nature of initial conditions. Conversely, it becomes very 
difficult to put limits on the type of the initial conditions without using external, non-CMB 
priors on the late Universe parameters. Indeed, the CMB is perhaps the most effective way 
to directly probe the very early Universe, and thereby constrain or falsify the models for the 
generation of perturbations. It is therefore very important to extract the most information 
about the conditions in the early Universe. Adding polarization information greatly enhances 
the power of the CMB to simultaneously constrain the late Universe parameters and the pri- 
mordial ones: we show in § 7.4 that the full four year WMAP data will measure orthogonal 
combinations of the late Universe parameters with an accuracy of the order 10% — 30% for 
most parameters even in the general initial conditions case. The Planck mission will have a 
better polarization resolution and will be able to do precision cosmology almost independently 
on the type of initial conditions (Trotta & Durrer, 2004). As for the possibility of mitigating 
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the cosmological constant problem by introducing isocurvature modes, our findings in § 7.3 
indicate that 7^ 0, as obtained from a combination of CMB and large scale structure data, 
is indeed robust even in the presence of isocurvature contributions. The more conservative 
frequentist statistics - as compared to the usual Bayesian approach - excludes = only 
at the 2a confidence level for pre-WMAP CMB data combined with the 2dF Galaxy Redshift 
Survey, but this only if we admit a rather low value for the Hubble constant, h ~ 0.5, which 
would be in contradiction with the result of the Hubble Space Telescope, h = 0.72 ± 0.08 
(Freedman et al., 2001). 

Outlook and conclusion 

The CMB has become a well established tool for the study of our Universe, and an unavoidable 
testing ground for any theoretical model. The ever improving quality of the data permits 
on one side to look for new physics in the early Universe, as shown in our study of time 
variations of a, on the presence of extra relativistic particles and on the existence of non- 
adiabatic modes; on the other hand, it also requires an upgrade of our modelling, so to 
properly treat subtle effects such as the uncertainty coming from our unprecise knowledge of 
the primordial Helium fraction, or from our ignorance on the correct model for the generation 
of fluctuations. For this reasons, it is important to look ahead, to the goals for the next 
generation of experiments, and to their potential to constrain or falsify the theoretical models. 

More than ever, the central issue is becoming how to efficiently and reliably extract 
the most information from upcoming high-quality data: there are about 2000 observable 
independent multipoles for each of the three angular power spectra, namely temperature, 
E-polarization and temperature-polarization cross-correlation, which however are highly re- 
dundant due to the smooth oscillatory nature of the spectra. The amount of information 
which can be extracted is much less, and can be condensed in maybe a dozen of well-chosen 
parameters. The best choice for those quantities is the one which takes into account the 
physics and selects orthogonal directions in parameters space on the basis of fundamental de- 
generacies. This idea has been a leitmotiv of the works presented here, and there is probably 
still space to apply it further, especially in connection with the primordial parameters. 

Despite this encouraging picture, there are still open challenges for our understanding of 
the Universe: the nature of dark energy and dark matter, the details of the initial conditions 
and the epoch of reionization, for example. The CMB will provide key advancements on 
all these issues over the next years. The polarization of the anisotropies has been detected 
by the experiments DASI (Kovac et al., 2002) and WMAP and will be precisely mapped by 
the forthcoming experiments PolarBear, Bicep, SPOrt, AMiBA and QUEST, opening up a 
new line of research and allowing to reconstruct the cosmological parameters with still higher 
precision. This process will culminate with the European Space Agency satellite Planck 
(Planck Website, 2004), which starting in 2007 will observe the temperature spectrum with 
the ultimate possible precision and provide accurate mapping of the polarization as well. In 
view of this wealth of data, and in order to fully exploit its potential, it is of fundamental 
importance that theoretical research on the subject advances accordingly. There is a need of 
more powerful and efficient computational and statistical techniques which can handle the 
considerably larger amount of data expected. Also, our theoretical understanding of model- 
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building has to be refined and in particular we need to further develop the interdisciplinary 
link between models coming from high energy physics, string theory, astrophysics and their 
observational signature on the CMB. This approach will strengthen the role of the CMB as a 
universe-size laboratory for investigating the most elusive domains of fundamental physics. 



All men, Socrates, who have any degree 
of right feeling, at the beginning of every 
enterprise, whether small or great, always 
call upon God. And we, too, who are go- 
ing to discourse of the nature of the uni- 
verse, how created or how existing with- 
out creation, if we be not altogether out of 
our wits, must invoke the aid of Gods and 
Goddesses and pray that our words may 
be acceptable to them and consistent with 
themselves. 

Plato 
Timaeus 



Part I 
BASICS 



Chapter 1 
Introduction 



1.1 Notation and conventions 

We begin by introducing the notation and conventions which are used throughout this work. 

• The metric signature is — h ++. 

• The spacetime metric is denoted by g^ v , where the spacetime coordinate are x^,p = 
0, 1, 2, 3. Greek indexes always run from to 3. 

• The 3-space of constant curvature has metric 7^ . Latin indexes always run from 1 to 
3. 

• When we discuss perturbations, the background, unperturbed quantities are denoted by 
an overline. Therefore for instance p = p + 5p, where p denotes the background energy 
density and p the perturbed (background plus linear perturbation) energy density. 

• The overdot " ' " denotes the derivative with respect to conformal time, rj. 

• Bold character denote the £ = 1,2,3 components of the corresponding 4- vector. 

• Unless otherwise stated we use natural units, in which the speed of light, the Boltzmann 
constant and the Planck constant are unity, c = ks = h = 1. 

• The Hubble parameter today is written as Hq = 100 h km s~ x Mpc -1 . 

• The symbol Q,x denotes the density parameter in the component X (where X can stand 
for baryons, photons, cold dark matter, etc.), expressed in units of the critical energy 
density. In general, £lx = £lx(v)i but whenever we omit the explicit time dependence, 
it is understood that the quantity is evaluated today, i.e. fix = &x(vo), where tjq is 
the present value of conformal time. 

• The critical energy density today is p CIlt {r)o) ~ 1-88 • 10~ 29 h? g/cm 3 , and the present 
energy density of component X is written px(vo) = 1-88 • 10~ 29 g/cm 3 , where we 
have defined ujx = ^x(vo)h 2 . 
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1.2 Friedmann- Robertson- Walker cosmology 

In this section, we briefly review the standard treatment of an homogeneous and isotropic 
universe. We present the background Einstein and conservation equations for perfect fluids, 
along with the unperturbed Boltzmann equation describing relativistic particles. 



1.2.1 Einstein equations 

The cosmic microwave background is homogeneous and isotropic to better than one part 
in lOO'OOO. This justifies the assumption that the universe, on large enough scale, can be 
treated as being homogeneous and isotropic. We then consider a 4-dimensional manifold 
A4 endowed with a metric g^ V) so that constant-time hypersurfaces are constant-curvature, 
maximally symmetric 3-spaces. The Friedmann-Robertson- Walker (FRW) metric reads 

g^dx^dx" = -dt 2 + a(t)j ij dx i dx j , (1.1) 

with the 3-space metric of curvature K. = {0, +1, —1} given by 

- /ij dx i dx j = dr 2 + x 2 {r){d9 2 + sin(fl) 2 dc/> 2 ) . (1.2) 

Here the scale factor a(t) depends only on time, and 



X{r) 



r for K, = (flat universe) 

sin(r) for JC = +1 (closed universe) . (1.3) 
, sinh(r) for K, = — 1 (open universe) 



We will mostly work in conformal time rj, defined through dr/ = a (t)dt, so that the 
FRW metric reads 

g^dx^dx" = a(r ? )(-dr ? 2 + 7ii dxW) . (1.4) 

Following the assumptions of homogeneity and isotropy, the background energy-momentum 
tensor, T^ u is bound to be of the perfect fluid form 

Tfj, u = (p + P)u fJ ,u u + Pg^y , (1.5) 

where p, P are functions of the conformal time r\ only, and represent the fluid energy density 
and pressure, respectively. The fluid 4-velocity is the timelike 4-vector u, with 

i, 0,0,0^ and = -1 . (1.6) 

We suppose that the equation of state of the fluid is of the form 

P = w(p)p, (1.7) 

where the enthalpy w(p) depends only on the local energy density. In many cases of interest, 
the enthalpy is simply a constant, in which case it is termed equation of state parameter, for 
cold, non-relativistic, pressureless matter w m = (dust), for relativistic particles w r = 1/3 
(radiation) and w\ = —1 for a cosmological constant (vacuum energy). The energy density 
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of a cosmological constant is contained in T^ Vl and is of the form pa = A/(8ttG). Another 
relevant quantity is the adiabatic sound speed of the fluid, defined as 

c 2 = P/p. (1.8) 

The Einstein equations 

= SttGT^ (1.9) 

with the FRW metric (1.4) and the energy-momentum tensor (1.5) yield the two Friedmann 
equations. The first Friedmann equation is a first order differential equation for the conformal 
Hubble parameter 7i(rj) = a/a 

H = -^L a 2 {p + W). (1.10) 

The second one is a constraint equation, 

H 2 + IC = ^-a 2 p. (1.11) 

An evolution equation for the fluid energy density follows from the component of the 
energy-momentum conservation equation, V fl T^ u = 0: 

p + m(p + P) = 0, (1.12) 

supplemented with the fluid equation of state, Eq. (1.7). If the universe contains (or is 
dominated by) only one fluid with w = const, it follows from Eq. (1.12) that its energy 
density behaves as 

poca" 3 * 1 ^, (1.13) 
hence from Eq. (1.10) the scale factor of a flat universe (K, = 0) is 

2A 



a 



l + 3w 



for w #-1/3. (1.14) 



with A 2 = 8irG/3pa 3 ( 1+w ^ = const. In particular, in the radiation dominated universe 
(w = 1/3) we have a oc r], while in the matter dominated universe (w « 0) a oc rj 2 . 

In the standard cosmological picture, the universe contains non-relativistic, pressureless 
matter (baryons and cold dark matter), photons, massless neutrinos and a vacuum energy 
component. In this case, the stress-energy tensor is the sum of the fluid components 

T" v = J2 T r- (1-15) 

a 

The Friedmann equations (1.10, 1.11) apply to the total energy density and pressure, which 
are just the sum of the contributions from each fluid. The energy conservation equation, 
Eq. (1.12), still applies to the total variables, while in general for each component we have 

V M 2r = Q£, (1-16) 

where the 4 vector Qa U describe the energy-momentum transfer from the component a. The 
conservation of total energy requires 

E^ = - (Li?) 
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In the general case, the Friedmann equations have to be solved numerically. However, we 
can easily write down solutions of simple cases. From Eq. (1.13) it follows that for radiation 
p r oc a~ 4 while for matter p m oc a~ 3 . Physically, the energy density of matter is diluted 
by the growth of the physical volume of the 3-space, while for radiation an extra a^ 1 factor 
comes in from the redshifting of the particles energy. Hence, since a is growing, at early 
enough time the universe is radiation dominated. The equality time is defined as the time at 
which the two contributions are equal, i.e. p r — Pmj after which the universe becomes matter 
dominated. Therefore 

^eq Pr 



ao Pr, 



10~ 3 , (1.18) 



'/" 



or in terms of the redshift z = a^/a — 1 we have 

z eq « 3000. (1.19) 

The subscript indicates that the quantity is evaluated today. The numerical estimate 
comes from the measurement of the present day radiation density in the cosmic microwave 
background, which together with the assumption of three massless neutrino families yields 

P, = 7.94.10- 34 (J^) 4 g/cm 3 . (1.20) 

The matter content of the Universe is obtained from the combination of CMB, large scale 
structure and supernovae type IA measurements. We shall see in § 4.2 that the CMB itself 
is a good probe to determine the redshift of equality. 

Since for a cosmological constant wa = —1, Pa = const, its contribution is negligible in 
the early universe, and indeed for a redshift 

>»{k) 3 - 1 ~ 0fJ - (L21) 

However, if A / 0, the late universe will be dominated by the vacuum energy term. In that 
case, a(t) oc exp [(A/3) 1//2 t] and the expansion becomes exponential (in physical time). 

It is customary to introduce the critical energy density as the energy density for which 
the universe is flat 

3H 2 

We also define the Hubble parameter Hq = H./ao and the fudge factor h 

H = 100/j km s^ 1 Mpc" 1 . (1.23) 
The critical energy density today then evaluates to 

p crit ( % ) ~ 1-88 • 1(T 29 h 2 g/cm 3 . (1.24) 



At all times, the density parameters tlx give the contribution of the component X in units 
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of the critical energy density: 

Slr(v) = — , (1-25) 

Pcrit 

n m (v) = — , (1-26) 

Pcrit 

n ^)^^ = -^— , (1.27) 
By definition the sum of the density parameters has to be unity 

n r ( v ) + n m ( v ) + n A ( v ) + n^v) = i • (1.29) 

The physical energy density of the component X is then given by 

px(v) = ttx(r))p ctit (r)) , (1.30) 

and in particular when evaluating this quantity at the present time we define ux = ^x{vo)h 2 
and write 

Px(vo) = 1.88 • 10" 29 g/cm 3 . (1.31) 

The definition (1.28) expresses the energy density due to the curvature of the spatial 
sections for K. = ±1. Since Qjc oc TL~ 2 oc r/ 2 , the curvature is always negligible in the early 
universe. Various cosmological observations indicate that today flic ~ 0. However, if the 
universe is not exactly flat, this would imply that at Planck time \Qjc\ ~ O(10 -60 ). The 
smallness of this number is the essence of the "flatness problem". The inflationary mechanism 
indeed naturally provides a solution for this fine tuning problem: as the universe inflates 
quasi-exponentially, its curvature is driven to 0. 

A key quantity is the angular diameter distance Da(z): consider an object of physical 
length d sitting at a redshift z\ (corresponding to conformal time r\\ and radial distance ri), 
which is observed at our present position (zq = 0, ro = 0) under an angle 9. Then the angular 
diameter distance is defined as 

d a(vi) = ^ = a(vMvo -Vi), (1-32) 

where in the second equality we have used d = Xa(r]i), with A the comoving length of the 
object, and 6 = A/x(ri), noting that r\ = r/o — rj\ since light travels on null geodesies. We 
can now integrate Eq. (1.11) to find 



1 r° da 

^■r) = r]Q- 771 



1/2 

Q r + O m \-Q?c—~+ ^a—j 



H a 

This equation is more conveniently written in redshift space 



2 4 

a „ a „ a 



;i.33) 



i r zi <\z 

Ar? = Ti— / TH ■ ( L34 ) 

H a Jo [fi P (l + z) 4 + n m (i + z) 3 + Sl K (l + zf + n A } 1/2 

Recall that the quantities fix above are evaluated at the present time. So if we know the 
physical length of an object at a given redshift, and we measure the angle subtended by 
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it on the sky, we are in principle able to extract the value of the cosmological parameters 
using Eq. (1.34). The CMB provides exactly such a standard rod on the sky: the acoustic 
oscillations of the photon fluid just before recombination have a characteristic length scale, 
which shows up as the first peak in the angular power spectrum, see § 4.1.2. The redshift of 
recombination is also known with good accuracy, hence the CMB measures with high precision 
the angular diameter distance to the last scattering surface. This piece of information alone 
is however insufficient to reconstruct completely the matter-energy content of the Universe: 
this problem is known as geometrical degeneracy, and it is explained in § 4.1.2. 



1.2.2 Boltzmann equation 

At early time, the energy density of the universe is dominated by the relativistic species, 
and to leading order we can neglect in the contribution of non-relativistic components to 
the total energy. As long as photons are in local thermodynamical equilibrium, the photon 
temperature T is related to the energy density of radiation by 

7T 2 

Pr = — g*T\ (1.35) 
where g* counts the total number of relativistic degrees of freedom 

0* = E^ + E^/j4 (!- 36 ) 

b f 

and b and / run over the bosonic and fermionic species respectively. The factors Tj, and 
Tj take into account possible temperature differences between the photons and the other 
relativistic particles. From Eq. (1.35) and p r cx a -4 it follows that while the photons are in 
thermodynamical equilibrium, T oc 1/a. 

For T > 4000K as 0.4eV hydrogen nuclei are ionized, and photons are coupled to baryons 
via non-relativistic Thomson scattering off free electrons, see § 2.2.5. As the temperature 
drops below 0.30eV, corresponding to Zdec ~ 1100, almost all the hydrogen nuclei quickly 
recombine, the mean free path of photons becomes larger than the Hubble length 1/TL: the 
universe becomes transparent. This event is called last scattering or decoupling. 

After recombination, the photon distribution function 

evolves according to the collisionless Boltzmann equation, which can be derived by requiring 
that the total derivative of / with respect to the affine parameter A vanishes 

g-0. (1.38) 

In general / = f(r],x i ,E,ri i ), where the momentum 4-vector p^ = (p°,p) is written as 

p" = ^(l,n), (1.39) 



with 



j=M n i p o = ^ = M, ( i.4o) 
a a a 

\/piP l = |p| , n l n j jij = 1 . (1-41) 
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Prom Eq. (1.38) we have 



drj dx %n dE ~^ dn l ™ ^ (^-^) 



Because of isotropy, df/dri 1 = 0, while homogeneity implies df/dx l = 0. Using the 
component of the geodesies equation 



dp" 



+ r>V = o ) (1.43) 



which in the FRW universe reads 



E+HE=0 (1.44) 
we obtain from Eq. (1.42) the background Boltzmann equation 

df df 

This equation is satisfied by any / of the form / = f(aE). We conclude that after decoupling 
the energy of the cosmic photons is redshifted by the expansion as E oc a -1 . The black 
body distribution, Eq. (1.37), retains its spectrum. The spectrum of the cosmic microwave 
background photons has been measured very accurately by the FIRAS spectrometer onboard 
the COBE satellite (Fixsen et a!., 1996), and was found to be exceedingly close to thermal. 
Deviations from a perfect black body spectrum can be measured by the Comptonization 
parameter y, the chemical potential \i and the parameter Yjj describing contamination by 
free-free emission. The 95% confidence limits on those parameters are 

|^|<9-10~ 5 , \y\ < 1.2 • 10~ 5 , \Y ff \ < 1.9- 10~ 5 . (1.46) 

After decoupling, T is no longer a temperature in the thermodynamical sense, rather a 
parameter in the distribution function, which drops as T oc a -1 . 



1.3 Cosmological observations 

It is only in comparatively recent times that cosmology has become a data driven science, 
in which theoretical hypothesis can be falsified or validated against observational data. It is 
amazing that only 15 years ago the total energy density of the universe was known with order- 
of-magnitude accuracy only. Nowadays, most cosmological parameters are constrained within 
a few percent. The discovery and accurate mapping of CMB fluctuations has constituted a 
major pillar in this evolution and represents a fundamental cornerstone of modern cosmology, 
see § 5.3 for an overview. 

It is nevertheless of equal importance that many other cosmological probes have been 
developed in parallel, and this for at least two good reasons. Firstly, all observation suffers 
in one form or in another from the degeneracy problem: only a certain combination of cosmo- 
logical parameters can be measured accurately. Since degeneracy directions are different for 
different observations, combining two or more measurements leads to tighter constrains on 
the parameters we are interested in. The second reason is that cosmologically relevant mea- 
surements are intrinsically difficult. One obvious obstacle is that there is only one universe for 



18 



Introduction 



Quantity 




Value 


Observations 


Baryon density 




0.024 


CMB, BBN, light elements abundance 


Cold dark matter density 




0.116 


CMB+LSS+SN, clusters 


A density 




0.378 


CMB+LSS+SN+weak lensing 


Hubble constant 


h 


0.72 


HST, SZ, strong lensing 


Optical depth 




0.17 


CMB 


Spectral index 


n s 


1.00 


CMB, LSS, Lyman-a, clusters 


Baryons 


n b 


0.046 




Cold dark matter 


^cdm 


0.224 




Cosmological constant 




0.73 




Radiation 


^rad 


7.95 • 1(T 5 


CMB 


Massless v families 


N v 


3.04 


CMB+LSS 


Curvature 




0.00 


CMB+LSS+SN+weak lensing 


Initial conditions 


purely adiabatic 


CMB 



Table 1.1: Parameters of today's "ACDM cosmological concordance model", which is in good 
agreement with most of the current observational evidence coming from CMB (Spergel et al., 
2003), large scale structures (LSS) (Tegmark et al., 2004b), Big-Bang Nucleosynthesis (BBN) 
(Fields & Sarkar, 2004), supernovas type la (SN) (Tonry et al., 2003), strong (Kochanek 
&; Schechter, 2004) and weak lensing (Contaldi et al., 2003), Lyman-a absorption systems 
(Seljak et al., 2003a) and galaxy clusters (Bahcall et al., 2003) observations. 

which the experimental conditions cannot be manipulated at will. Very often the interesting 
physics is hidden behind foreground emissions, poor statistical sampling, faint signals and 
non-linearities. It is common to try and extract cosmological information by using objects 
whose physical properties are poorly understood, and in general systematics are very difficult 
to assess in cosmology. Hence a cosmological measurement is usually considered as valid only 
if confirmed by one or more independent pieces of evidence. 

The so-called ACDM concordance model is strongly supported by several independent 
observational data. It is generally accepted that our universe is very close to flat (flic ~ 0); 
that it is dominated by "dark energy" ($7 a ~ 0.7), perhaps in form of vacuum energy, or 
quintessence or a tracking scalar field; that around 25% is non-interacting cold dark matter, 
and that only the remaining 5% is constituted of baryons. If the three neutrino families of 
the Standard Model of particle physics are not massless (as the large mixing angle solution 
to the solar neutrino problem seems to suggest), than their mass is bounded from above to 
be m u < C(l)eV. Structure formation proceeded by gravitational instability from quantum 
fluctuations stretched to super-horizon scale by a period of superluminal expansion (inflation). 
The simplest inflationary model, in which inflation is driven by one single slow-rolling scalar 
field, successfully predicts the absence of non-Gaussianity, the (predominantly) adiabatic 
nature of the fluctuations and the almost scale invariant spectral index (n s ~ 1) for the 
perturbations. The age of the universe, around 13 Gyrs, easily accommodates the oldest 
observed objects. For definiteness, in Table 1.1 we give the parameters of what we believe 
is a currently widely accepted "concordance model", to which we will refer throughout this 
work for illustrative and comparative purposes. 



1.3 Cosmological observations 



19 



Apart from CMB anisotropics, which we will discuss in depth in the rest of this work, we 
briefly present some of the pieces of observational evidence which corroborate the (presently) 
standard ACDM scenario. 

1.3.1 Big-Bang Nucleosynthesis 

Big-Bang Nucleosynthesis is based on the Standard Model of particle physics, and gives 
predictions for the abundance of light elements D, 3 He, He and Li synthesized in the early 
Universe, which are in good overall agreement with the observed abundances, see Olive et al. 
(2000) for a review and Fields & Sarkar (2004) for more recent results. 

Below a temperature T ~ 1 MeV the neutron-proton conversion rate falls below the expan- 
sion rate, and the neutron to proton ratio freezes out at the value n/p = exp (-Q/T) ~ 1/6, 
where Q = 1.293 MeV is the neutron-proton mass difference. The light elements production 
starts slightly afterwards, at a temperature T ~ 0.1 MeV, which is well below the binding 
energy of deuterium, Bd = 2.23 MeV because photo-dissociation prevents the formation 
of deuterium and other nuclei until then. By this time, /?-decay has further reduced the 
neutron-to-proton ratio to n/p « 1/7. The surviving neutrons end up almost completely 
in 4 He, while the abundance of the other elements is sensitively dependent on the nuclear 
reactions rates, which in turn depend on the baryon density, usually expressed with respect 
to the photon density by defining the parameter ?yio as 



where r/o is the conformal time today. A simple counting argument, see Eq. (6.16, page 136), 
yields that the primordial 4 He mass fraction is about 25%, while the number densities of the 
other elements relative to hydrogen turn out to be of the order D/H ~ 3 He/H ~ 10~ 5 and 
7 Li/H ~ 10~ 10 . The predictions are very reliable and accurate, with a residual numerical 
uncertainty which depends on the experimentally determined reaction rates; interestingly, it 
turns out that most of this uncertainty is associated with our only approximative knowledge 
of the neutron lifetime (Cuoco et al., 2003). The other free parameter of BBN is the radiation 
density in the early Universe, which sets the Hubble expansion rate and therefore determines 
the freeze-out temperature for the weak reactions and is usually parameterized with the 
equivalent number of (massless) neutrino families. We comment on the possibility of a non- 
standard number of neutrino families and discuss BBN-related issues in § 6.1.2. 

In summary, agreement between the abundance of the light elements as inferred from 
astrophysical measurement and the corresponding prediction of BBN is a powerful tool to 
verify the Standard Model of particle physics. In § 6.2.3 we present in detail the determination 
of light elements, discuss the slight discrepancies between them and the BBN predictions and 
give some possible interpretations. However, the overall agreement is satisfactory, and (for a 
standard number of neutrino families) the light elements abundances can be explained by a 
baryon density compatible with the one independently inferred from CMB, namely r/io ~ 5.5 
or u>b ~ 0.02. 

1.3.2 Matter distribution 




10 



274 • u b (rj ) 



(1.47) 



Structure formation proceeds from small inhomogeneities in the matter distribution which 
grow by gravitational instability, eventually giving rise to the large scale structures like galax- 
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ies and clusters observed today. From the determination of the statistical distribution of 
matter one tries to reconstruct the properties of the primeval fluctuations, and to validate 
the structure formation model. 

In § 3.6.3 we introduce the linear matter power spectrum P m (k), which represents the 
Fourier transform of the 2-point correlation function for the matter density contrast. Obser- 
vations of the distribution of galaxies out to a redshift z ~ 0.1 probe the galaxy-galaxy power 
spectrum, P gg ; the Sloan Digital Sky Survey, for example, currently contains approximately 
2 x 10 5 galaxies (Tegmark et al., 2004a), and upon completion will achieve 10 6 galaxies. 
The problem is then to relate P gg (k), which probes the luminous matter distribution, with 
the underlying P m (k) describing (mostly) the dark matter distribution. This is the issue 
of bias, introduced by Kaiser to explain the different amplitudes of the correlation function 
for galaxies and for clusters (Kaiser, 1984, 1987): the basic idea is that galaxies represent 
peaks of the matter distribution, and therefore our observations of P gg actually select only 
the regions of the underlying matter distribution above some threshold. This concept has 
been extended to various kinds of bias: luminosity-dependent, morphology-dependent, color- 
dependent bias, scale-dependent bias, anti-bias, and others. The simplest form is to assume 
a scale-independent bias, which seems to be justified on large (linear) scales, setting 

P gg (k) = b 2 P m (k) for k < Ri 0.3 /iMpcr 1 (1.48) 

with the bias parameter b which is just an unknown constant factor (see however e.g. Durrer 
et al., 2003a for a critical discussion). In practice, this prescription amounts to introducing 
a free parameter which controls the amplitude of the matter power spectrum. There are 
methods which allow to determine the bias from the higher-order n-point function of the 
distribution: for instance Verde et al. (2002) found b = 1.04 ± 0.11 from the data of the 2dF 
Galaxy Redshift survey (Colless et al., 2001), which plans to measure 2.5 x 10 5 galaxies. 

One can also consider the distribution of galaxy clusters as a function of redshift, which in 
principle one should be able to predict by using hydro-dynamical simulations. Comparison 
with the observed distribution would then allow to constrain the cosmological parameters. 
This simple sounding program is in practice complicated by the need of accurately simulating 
all the relevant physics, and despite the great amount of computational power nowadays 
available, recent works in the field still involve many approximations. As a result, cluster 
data mainly constrain a combination of the matter power spectrum at clusters scales and the 
value of O m , see e.g. Bahcall et al. (2003). 

Another way to probe the mass distribution is offered by the Lyman a forest, the absorp- 
tion lines in the spectra of distant quasars produced by the neutral hydrogen in regions of 
overdense intergalactic gas along the line of sight at a redshift 2 — 4 (Croft et al., 2002). Since 
the overdensities probed at these redshifts are still close to the linear regime, one hopes to 
be able to connect the observations to the matter power spectrum by modelling numerically 
the relevant physics (Mandelbaum et al., 2003; Seljak et al., 2003a). 

Weak gravitational lensing is very promising as a tool to constrain cosmological parame- 
ters, and in particular the matter distribution. It uses the distortion in the images of distant 
galaxies induced by inhomogeneities in the intervening matter distribution (Kaiser & Squires, 
1993), and reconstructs with a statistical analysis the so-called "cosmic shear" ( Wittman et al., 
2000; Bartelmann & Schneider, 2001). The technique is now rapidly becoming mature to help 
constrain the matter budget (Contaldi et al., 2003). 
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One of the most important aspects is that all of the above observations can be combined 
to achieve superior constraining power on the CDM model parameters, while testing the 
consistency of the theory itself, or the soundness of each data-set. A technique to merge 
galaxy surveys, cluster distribution, weak lensing and Lyman a data with the CMB to probe 
a larger portion of the matter power spectrum is presented in Tegmark & Zaldarriaga (2002). 
There is presently a general agreement that the matter content of the Universe is low, around 
£l m ~ 0.3. 



1.3.3 Type la supernovae 

Supernovae (SN) are classified according to their spectrum: the type la is characterized by 
the absence of hydrogen (the "I"), and by strong silicon features (the "a"). The standard 
picture is a progenitor binary system, with a white dwarf which accretes matter from its 
companion until it reaches the Chandrasekhar limit, and the gravitational infall triggers a 
thermonuclear explosion which we observe as a supernova. At the peak of its brightness, a 
SN can easily exceed the luminosity of its host galaxy, making it a promising candidate to 
measure distances out to very high (z ~ 1 — 2) redshifts. 

Their most important property is the remarkable homogeneity in their spectra, in the 
shape of their light-curve and in their peak absolute magnitude, which makes them nearly 
"standard candles". In fact, it was discovered that intrinsically brighter SNIa decline more 
slowly than dim ones (Hamuy et al., 1996). By exploiting an empirical correlation between 
the shape of the light curve and the intrinsic luminosity, and correcting for extinction effects 
via measurements at different wavelengths, it is nevertheless possible to produce a "calibrated 
candle", with a very narrow peak magnitude dispersion (Riess et al., 1996). For a review of 
the cosmological applications, see e.g. Filippenko (2004). 

The measured apparent magnitude m is related to the absolute magnitude M via the 
luminosity distance Dl 

m = M + 5log[H D L (z,n m ,n A )} + K (1.49) 

where the ".fT-correction" compensates for the difference in wavelength of the emitted and 
received photons due to the expansion, and the luminosity distance of an object at redshift 
z is defined in terms of the intrinsic luminosity L and of the measured flux i as 

/ r \V2 

D 'W<«) • (1 - M > 

The luminosity distance is related to the angular diameter distance by Dl(z) = (1+ z) 2 D a{z) ■ 
Supernovae essentially measure the angular diameter distance over a redshift range of z ~ 
0.5 — 2, much lower than range probed by the CMB. At such low redshift, the radiation 
content is negligible, and with Qj; = 1 — n m — we obtain from (1.32) and (1.34, page 15) 



H D L (z 1 ,n m ,Q A ) = x 
V \^K\ 



X (^=\ J* [C 1 + + zn rn) - Vaz(2 + z)] 1/2 dz 



11.51) 
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Figure 1.1: Illustration of the determination of (fi m , Oa) using supernova? data: the dashed 
(solid) curves are lines of constant Dl for the given measured apparent magnitude of a 
standard candle at a redshift z = 0.5 [z = 1.0). If the apparent magnitude m can be 
measured with accuracy Am = 0.05 combining the two observations gives the dark shaded 
allowed region for (Q m ,Q,\). Figure reprinted from Goobar & Perlmutter (1995). 

where the function x 1S defined in Eq. (1.3, page 12). Notice that magnitude-redshift re- 
lation (1.49) does not depend on the Hubble parameter. Therefore, assuming that we are 
able to reliably reconstruct the intrinsic luminosity M, from the measurement of one SN 
Eq. (1.49) yields one degeneracy line for the possible values of (Q m ,£l/0- By measuring a 
second standard candle at Z2 ^ z\ we are able to determine the intersection of the degenerate 
luminosity distance lines in the (fi m , Qa) plane, and thus to measure separately the matter 
and cosmological constant content. When we add the measurements error, both lines widen 
to two strips, and we obtain a region of confidence for the two parameters, independently on 
the Hubble parameter, see Fig. 1.1. 

In practice, of course, a larger number of measurements is necessary, and it turns out 
that the approximate combination Q m — ft\ is well constrained, as it is intuitively clear from 
Fig. 1.1. For instance, Tonry et al. (2003) found 



This degeneracy direction is almost orthogonal to the one in inferred from the angular diam- 
eter distance at z ~ 1100 measured by the CMB, cf. Fig. 4.1. Combination of supernova? and 
CMB data is thus a very effective way to break the angular diameter distance degeneracy 
and to constraint the matter and vacuum energy contents separately. As we have seen, obser- 
vations of the matter distribution on large scales independently constrain the matter density 
parameter: it is a remarkable achievement of modern cosmology that this "cosmic comple- 
mentarity" seems to be pointing toward the same value, namely VL m ~ 0.3 and Oa ~ 0.7, 
see e.g. Spergel et al. (2003). At the same time, the puzzle of the nature of dark matter and 
dark energy remains unsolved, and we offer some further remarks regarding the cosmological 
constant in 8 7.3. 



n A - 1.4a 



<m — 



0.35 ±0.14 (at la). 



(1.52) 



Chapter 2 

Cosmological perturbation theory 



In order to understand the physical origin of CMB anisotropies, we are interested in study- 
ing the evolution of perturbations in the photon distribution function, by perturbing at 
linear order around the "background" solution for the homogeneous and isotropic Friedmann- 
Robertson- Walker (FRW) universe of § 1.2. That linear perturbation theory is sufficient to 
describe almost all aspects of CMB physics is a consequence of the smallness of the fluctua- 
tions. 

In § 2.1 we introduce the relevant perturbation variables, discuss the issues of gauge trans- 
formations and gauge invariant formalism, extend the treatment to multiple fluids and define 
entropy perturbations. We then present the perturbed Einstein (§ 2.2.1) and conservation 
equations (§ 2.2.2) for an Universe filled with four different particle species: baryons, cold 
dark matter (CDM), photons and massless neutrinos. The Bardeen equation is presented 
in § 2.2.3, while § 2.2.4 is devoted to the derivation of the collisionless Boltzmann equation, 
which describes massless neutrinos and photons after decoupling. The last section § 2.2.5 
concerns the Thomson scattering process which couples photons and baryons before recom- 
bination, and explains the origin of CMB polarization. 

Cosmological perturbation theory in the four-dimensional FRW universe is a well stud- 
ied subject, see e.g. Kodama & Sasaki (1984); Mukhanov et al. (1992); Ma & Bertschinger 
(1995); Durrer (1994). More recently, the formalism has been extended to higher-dimensional 
manifolds, involving extra dimensions (see e.g. Riazuelo et al., 2002), in view of the recent 
interest in string theory motivated braneworlds scenarios. 

2.1 Perturbation variables 

In this section, background (unperturbed) quantities are denoted by an overline, so that 
the perturbed energy density, e.g., is denoted by p = p + 5 p. The background quantities 
depend on time only, while the linear perturbations are functions of time and of the 3-space 
coordinate, i.e. 5p = 5p(r],x). 

2.1.1 Metric perturbations 

We perturb to linear order the FRW metric of Eq. (1.4, page 12) by setting 

g^dx^dx" = g^dx^dx" + a^^d/d/ (2.1) 
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where the perturbation h^ u is given by 

h^&x^&x" = -2Adrj 2 + 2£ i0 Vd?7 + 2H ij dx i dx j . (2.2) 

The perturbation variables A, Bi, Hij are arbitrary functions of the 4-coordinate vector x^ = (77, x) 

It is convenient to split them into components which transform irreducibly under the 
rotation group 50(3). The vector field Bi can thus be written as the sum of a gradient of a 
scalar and a divergenceless component (vector) 

Bi = B\i + Bl v) , Bf )li = . (2.3) 

We split Hij into an isotropic and an anisotropic part 

= C^fij + Eij , (2.4) 

and is further decomposed in irreducible scalar (spin 0), vector (spin 1) and tensor (spin 
2) components as 

Eij = E ] i j + l(E%+E^) + E^, (2.5) 



where 



= Ey )t3 = (divergenceless) , (2.6) 
Ef j = (traceless) . (2.7) 

Note that at this stage we are still working in real space and we do not perform an harmonic 
analysis of the perturbation variables (see Kodama & Sasaki, 1984; Durrer, 1994 instead). 
At linear order, the different spin components do not mix, and we can treat them separately. 

2.1.2 Perturbations of the energy- momentum tensor 

The perturbed energy-momentum tensor is obtained by perturbing in Eq. (1.5) the energy 
density 

p = p + 5p = p(l + 5) , with 5 = 5p/p, (2.8) 

the pressure 

p = p + SP = P(l + 7T L ), with ir L = 5P/P, (2.9) 
and the space components of the observer's 4-velocity 

u* = 6u< = -— = --(«'* + (2.10) 
a a 

u° = u° + 5u° = Ul-A), (2.11) 

and the second line follows from the norm of the 4-velocity u^u^ = — 1. 
The perturbation of the energy-momentum tensor is then written as 

ST^v = (p5 + Pir L ) u^Uu + (p + P) (8u^u v + Su^Uf,) + P {-klQuv + a 2 h^ + a 2 U^) , (2.12) 
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where we have introduced the anisotropic stress perturbation rLj„, which is a traceless tensor 
and orthogonal to the 4-velocity, u^H^ = 0. It describes off-diagonal, space-space pertur- 
bations in the stress-energy tensor, and can be split into a scalar IT, a divergenceless vector 
IT- v) and a trace- free tensor part IE , according to: 

% = (v.v,- - ^v fe v fc )n + i(ng + n$) + n« , (2.13) 

The perturbation components of the stress-energy tensor therefore take the form 

ST° = -pS, (2.14a) 

5T° { = (p + P^Bi - Vi ) , (2.14b) 

5T = (p + Py, (2.14c) 

SP. = P(7> L + IT ,) . (2.14d) 



2.1.3 Gauge transformations 

By choosing the background spacetime manifold and metric to be of the FRW form, we im- 
plicitly assume that for all quantity of interest Q we are able to define a spatially averaged Q, 
which represents the background, homogeneous and isotropic value of Q on (M.,g). Consider 
now a slightly perturbed manifold, -M pert , endowed with a coordinate system x^. The value of 
Q on .M P e r t depends on the choice of the coordinate system, Q per t = Q + 5Q(x' M ). Along with 
x^, any other coordinate system which leaves g invariant is admissible, i.e. we can arbitrarily 
transform the coordinates by an infinitesimal amount 

x » _> y» = x» + 5x t " (2.15) 

thereby obtaining for Q in this newly defined coordinates 

Q P U^) - Q P Uv*) = QpertK) + CsxiQ) » ( 2 - 16 ) 

where Cx(Q) is the Lie derivative of Q with respect to the vector field X, see e.g. Straumann 
(2004). Such infinitesimal coordinate transformations are called gauge transformations, and 
the above result is known as Stewart-Walker Lemma. Fixing the coordinate system on M. peit 
is called a gauge choice. Clearly, physical observables are geometrical quantities, and are 
therefore independent of the coordinate system in which they are calculated. The form of the 
equations, however, can be very different according to the gauge choice. It is often convenient 
to fix the gauge in the way which is best suited for the problem at hand. 
The gauge transformation Eq. (2.15) can be written in all generality as 

5x° = T, bx { = L'' + L (v)i . (2.17) 

By applying the transformation law (2.16) to the perturbed metric (2.1) under a gauge 
transformation of the type (2.17), we obtain the following transformation properties for the 
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metric variables: 



A - 


+ A + HT + T, 


(2.18a) 


B - 


■* B-T + L, 


(2.18b) 


C - 


■* C + HT, 


(2.18c) 


E - 




(2.18d) 




■+ 5 (v)i + L (v)i , 


(2.18e) 




■* + L (v)i , 


(2.18f) 




-> . 


(2.18g) 



The same procedure applied on the background stress-energy tensor T^ u and 4-velocity 
gives for the matter perturbation variables: 



5 - 


■* $ - 3TW(1 + to) , 


(2.19a) 




V 2 

7r L -^(l + w;)Wr, 


(2.19b) 


n - 


- n, 


(2.19c) 


V 




(2.19d) 


„M - 




(2.19e) 


nf 5 - 




(2.19f) 


n« - 




(2.19g) 



In order to completely fix the gauge, we need to specify in Eq. (2.2) the functional form 
of two scalar functions, corresponding to a specific choice for (T,L), and one vector, corre- 
sponding to a choice for L (v) \ In the following, we briefly summarize some popular gauge 
choices. 

Longitudinal gauge 

Longitudinal gauge (also sometimes called "Newtonian gauge") is defined by requiring B = 
E = B^ 1 = 0, so that the perturbed metric element takes the form 

ds 2 = a 2 [-(1 + 2^)drf + (1 - 2$)7; i dxW] , (2.20) 

and we have defined the Bardeen potentials ^ = A and $ = — C (Bardeen, 1980), which 
represent the gravitational time dilation and the perturbation to the 3-space curvature, re- 
spectively. From any other gauge, the transformation T = B — E, L = —E and L (v) * = — £? (v)l 
leads to the longitudinal gauge. 

Flat slicing gauge 

This gauge owns its name to the choice E = C = E (v)l = 0, which makes the spatial 
hypersurfaces unperturbed, and the metric element is 

ds 2 = a 2 [-(1 + 2A)dr/ 2 + 2B i dx i dri + tyydx'dx?] . (2.21) 

The coordinate transformation which leads to flat slicing gauge is T = —C/Tt, L = —E and 
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Synchronous gauge 

In synchronous gauge, constant time hypersurfaces are orthogonal to the 3-space (hence the 
name), i.e. (r],x l ) are Gaussian coordinates. This can be obtained by imposing A = B = 
B (v)l = 0. Thus the metric presents perturbations in the space-space part only, and it is often 
written as 

ds 2 = a 2 [-Arf + + fryJdarW] , (2.22a) 
= h\fj(ri,x) + (ViVj - -7i i V jfc V fe )6r/(r ? ,x) . (2.22b) 

The above choice does not fix completely the gauge: in fact, the gauge transformation 
which leads to synchronous gauge is 

T = — [ aAdi] + - (2.23a) 
a J a 

L = J (T-B)drj + P (2.23b) 
L {v)i = - [ B (v)i dri + (3 (v)i , (2.23c) 



which presents a residual gauge freedom in the four arbitrary integration constants a and 
f3 l = ft 1 + (where /3 (v)l must be divergenceless). The four constants correspond to 
different choices of the constant time hypersurface and of the spatial coordinates on it. This 
leads to the presence of fictitious "gauge modes" in the perturbation equations, which must 
be removed because they are just an artifact of the choice of the coordinate. Despite this 
difficulty, synchronous gauge is quite popular in the literature. 



Comoving gauge 

In the comoving gauge the total bulk velocity vanishes, 6T° i = 0, which translates into 
the condition B{ = V{. In order to completely fix the gauge one further requires E = 
and E^ 1 = 0. This is achieved with the transformation T = B — v — E, L = —E and 
£M» = —^M* This gauge is the one which resembles most the gauge invariant formalism 
(defined below), since for the variables in comoving gauge we have 



c 


= -c 


see 


Eq. 


(2.26) 


6 


= D 


see 


Eq. 


(2.30) 




= A a 


see 


Eq. 


(2.37) 


V 


= V 


see 


Eq. 


(2.31) 



(2.24) 



2.1.4 Gauge invariance 

General covariance guarantees that all equations in general relativity can be written in a form 
which is independent of the gauge choice (Bardeen, 1980; Kodama & Sasaki, 1984; Durrer, 
1994). From (2.16) it follows that for all tensor fields with vanishing or constant background 
contribution, so that Cx(Q) = VX, we can construct gauge invariant perturbation equa- 
tions. Such perturbation variables are invariant under a gauge transformation of the type 
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Eq. (2.15). Since we can cast all general relativistic equations in the form Q = 0, it is always 
possible to construct gauge invariant perturbation equations (Stewart & Walker, 1974). 

This approach has the advantage of leading to equations which are independent of the 
coordinate choice, and which are often easier to interpret physically. Furthermore, gauge 
independent equations are free from spurious gauge modes. In order to write down the 
relevant gauge invariant perturbation equations, we make use of the transformation properties 
of the metric and matter variables under a change of gauge, Eqs. (2.18) and (2.19). 



Metric variables 

From Eq. (2.18) we can construct the following 4 gauge invariant metric variables: 

$ = -C - H(B - E) , (2.25a) 

y = A + H{B-E) + (B-E), (2.25b) 
E W _ £M _ B M ^ ( 2 _ 25c ) 

Hlf = E§. (2.25d) 

The two scalar variables <3? and are called Bardeen potentials (Bardeen, 1980). Another 
very useful variable is the gauge invariant curvature perturbation C, which is defined as 

C = -C + H(v-B), (2.26) 

where v is defined in Eq. (2.10). From the constraint equation (2.50), it follows that for a 
flat universe, K, = 0, the gauge invariant curvature perturbation is related to the Bardeen 
potentials by 

C = $ + ^(W* + 6). (2.27) 

n 2 -n 

There is only one gauge invariant vector perturbation constructed out of metric variables, 
Eq. (2.25c). Tensor variables are automatically gauge invariant, since there is no spin-2 
coordinate transformation. 



Matter variables 

Because of the Stewart-Walker Lemma (2.16), the variables IT, IL; and IT- 4 ' are already 
gauge invariant, since the background anisotropic stress vanishes. 

From scalar matter variables alone we can construct the gauge invariant variable 

r = n L -£s, (2.28) 
w 

which measures the intrinsic non-adiabaticity of the matter content. More precisely, as we 
shall see below, T is related to the entropy production rate. If the pressure is a function of 
the local energy density only, P = P(p), then we can write 

5P P . 
— = - 2.29 
op P 

and since by definition bp = 5 ■ p, 5P = ttl ■ P, it follows that T = 0. In the case of a perfect 
fluid, P = wp and V vanishes. Non-zero contributions to V arise from the relative entropy of 
a mixture of several fluid components, which is discussed in § 2.1.5. 
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The choice of a gauge invariant density contrast is not unique, and requires the use of 
metric variables. Meaningful combinations are 

D s = 6 - 3(1 + w)H(B - E) (longitudinal), (2.30a) 
D g = 5 + 3(1 + w)C (flat slicing), (2.30b) 

D = S - 3(1 + w)H(B - v) (comoving). (2.30c) 

On super-horizon scales, D s corresponds to the density contrast in the longitudinal gauge; Dg 
is the density contrast on homogeneous 3-space hypersurfaces (flat slicing); D reduces to the 
density contrast in the comoving gauge. The distinction is only important on super-horizon 
scales, since on small (sub-horizon) scales, all the above variables reduce to the same (Durrer, 
2001). 

The remaining velocity perturbation can be written in gauge invariant form as 

V = v-E, 

yM = (v) _ £(v' 
i i % 

Useful relations between those gauge invariant variables are 



D g = D s - 3(1 + «;)$, (2.32a) 

D = D s + 3(1 + w)HV, (2.32b) 

D = D g + 3(1 + w)C, (2.32c) 

C = <& + HV. (2.32d) 



(2.31a) 
(2.31b) 



2.1.5 Multiple fluids 

The above definitions assume that the universe is filled with, or dominated by, only one 
fluid component. In a more realistic modelling, we must account for the presence of several 
matter components. We will usually consider four of them, namely photons (subscript 7), 
massless neutrinos (subscript v), non-interacting cold dark matter (CDM, subscript c) and 
baryons (subscript b). The subscripts r (radiation) and m (matter) will refer generically to a 
relativistic (w r = 1/3) and a non-relativistic, dust-like (w m = 0) fluid, respectively. Variables 
without subscript designate the total perturbation. 

If multiple matter components are present, the total perturbation variables are the weighted 
sum of the variables for each component: 

6 = yZ ^« ' 
P 

i Pa + Pa i 

v ^ Tfp~ a 

a ' 

a 

The equation of state and the adiabatic sound speed are defined for each component 

Pq , 2 Pa 

w a = — and c a = - — , 

Pa Pa 



(2.33a) 
(2.33b) 
(2.33c) 



(2.34) 
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and for the mixture we have 

P P 
w = — and ch = — . (2.35) 
P P 

The transformation properties of the variables for each components are the same as for the 
total variables, Eqs. (2.19). Hence for each matter component we can define gauge invariant 
variables as in Eqs. (2.28, 2.30, 2.31), yielding for the scalar part: 





„ c a x 

= ir a L o a , 

w a 


(2.36a) 


Va 


= v a -E, 


(2.36b) 




= S a - 3(1 + w a )H{B - E) , 


(2.36c) 


D a ,g 


= 5 a + 3(1 + w a )C , 


(2.36d) 


D a 


= 5 a - 3(1 + w a )H(B - v a ) . 


(2.36e) 



In the presence of multiple matter components, it is often useful to work with the gauge 
invariant density contrast 

A Q = S a - 3(1 + w a )H(B - v) , (2.37) 

which corresponds to the density contrast in the gauge where the total matter is at rest, i.e. 
the comoving gauge introduced on page 27. Notice that on the right hand side it appears the 
total velocity v, rather then the velocity of the a component as in (2.36e). This new variable 
is related to the density contrast in the flat slicing gauge by 

A tt = D 9 , a + 3(l + w a )($ + W). (2.38) 



2.1.6 Entropy perturbations 

When more than one component is present, entropy perturbations can arise even for a mixture 
of perfect fluids. The total non-adiabaticity of the mixture is given by (2.28), where the 
quantities appearing on the right hand side have to be interpreted as total variables. Using 
the definitions (2.33), we obtain 

PT = PT iQt + $ap a (c 2 a -c 2 s ),= P (r int + r rel ) . (2.39) 

a 

We have introduced the total intrinsic entropy perturbation 

Tint = £ (2.40) 

a 

and the relative entropy perturbation r re i, which using the background energy conservation, 
Eq. (1.16, page 13), can be recast as 

PIrel " 2 2^ otWp — {Ca " Cp) Irr^ " rr^J ■ {2Al) 

Here we have assumed that the components are decoupled from each other, i.e. that Q u a = 
in (1.16, page 13), see (Malik et al., 2003) for a generalization to the case of interacting fluids. 
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The quantity r re i represents relative entropy perturbations which are produced by the 
different dynamical behavior of the matter components with different sound speed. The 
entropy perturbation between the components a and (3 is defined as 

Safi = -±—-*-. (2.42) 

1 + W a 1 + W/3 

It is easy to see that the entropy perturbations are gauge invariant quantities by substituting 
the gauge dependent density contrasts on the right hand side with the gauge invariant density 
variables defined in (2.37), obtaining 

Sap = ^--^. (2.43) 

1+W a l + W/3 

In order to clarify the physical meaning of S a p, consider a mixture of radiation and dust- 
like matter. We are interested in fluctuations of the number density (per physical volume) 
ratio of the two species: 

^) /(n r n m ) = - *± . (2.44) 
n m J n r Tim 

Recall that (see e.g. Kolb & Turner, 1990) n r oc s oc T 3 , with s the radiation entropy per 
volume, hence 

Sn L= 5s =3 ST = 3S_p L 
n r s T 4 p r 

For matter we have 



5n m 5p n 



n, 



and therefore 



(2.46) 



Srm ■ (2.47) 



(1 + w r ) (l + w m 

Thus a non vanishing relative entropy perturbation means that there are spatial inhomo- 
geneities in the relative number density of the the two fluids, which can be understood as a 
spatial variation in the equation of state. The above results are generalized in § 4.3. 



2.2 Perturbation equations 

In this section, we write down the first order perturbation equations using the gauge invariant 
formalism and variables defined above. For completeness, we also give the vector and tensor 
equations, but in the rest of this work we will concentrate exclusively on the scalar sector. 

2.2.1 Einstein equations 

The perturbed Einstein equations 

SG^ = MGST^ (2.48) 
are split in their scalar, vector and tensor parts. 
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Scalar equations 

There are 4 scalar equations for the 4 gauge invariant quantities 3>, \P, V and D: 

(A + 3/C)$ = 4irGa 2 pD (Poisson), 

+ 6 = 47rGa 2 p(l + w)V (constraint), 
<3? — ^ = 8-irGa 2 pwll (anisotropic stress), 



HU + (H z +2H)U = 4irGa z p ( cjD g + wF + -wAn 



where 



n 



$ -i — 

w 2 n 



(2.49) 
(2.50) 
(2.51) 

(2.52) 



(2.53) 



Recall that D g is related to D, 1/ and <3? via Eqs. (2.32, page 29), and we have assumed 
an equation of state of the form (1.7, page 12). Eq. (2.49) is the general relativistic analogue 
of the Poisson equation. In order to close this system, we need to specify the matter content 
by giving w, c 2 , V and II. For a single perfect fluid, T = II = 0, hence from the anisotropic 
stress equation (2.51) it follows that ^ = <J>. 

We shall see below that an evolution equation for II follows e.g. from the kinetic description 
provided by the Boltzmann equation, see Eq. (2.127, page 42). For multiple fluids, we will 
also rewrite T in terms of the relative entropy perturbations, as in Eq. (2.41). 



Vector equations 

The vector part yields a constraint and an evolution equation for and X^ v) : 

(2K + A + A{H - H 2 )) £< v) = 16vrGpa 2 (l + w)V^ , (2.54) 
+ 2ft£f ) = 87rGpa 2 wU^ . (2.55) 

For a perfect fluid, n| v) = 0, the above equations give in a flat universe on large scales 
(such that gradients can be neglected) 

Ef = -yWocl. (2.56) 

Therefore in the absence of active seeds, vector perturbations are always decaying on large 
scales. 



Tensor equation 

The tensor part yields an equation describing the gravitational waves. It is the equation of a 
forced harmonic oscillator, with a damping term due to the expansion of the universe: 

£g> + 2HEf] + (2/C - A)E§ = 8TiGpa 2 n% . (2.57) 

On super-horizon scales and for zero curvature, the term oc E^ is negligible. The homoge- 
neous equation in the radiation era, when 7i = r]^ 1 , has a decaying solution E^ oc r/ -1 and 
a constant solution, = const. As a mode enters the horizon, the oscillatory behavior 
takes over, and the wave propagates with a frequency k 2 + 2/C and is damped as a -1 . In the 



2.2 Perturbation equations 



33 



absence of anisotropic stress and in a flat universe, K. = 0, the general solution of (2.57) for 
IT = 0, writing = /i(x, rf)£ij (x) and going to Fourier space in a flat universe, is given by 

h = {kr l ) 1 -i [Aj^krj) + Bn^krj)] , (2.58) 

where j v (x) and n v (x) are the Bessel and von Neumann functions of order u, respectively 
(see Eqs. 3.10, page 48) and a oc rj q . 

2.2.2 Conservation equations 

The conservation equations, which follow from the contracted Bianchi identity, offer evolution 
equations which are sometimes of a simpler form and are handy to manipulate. From the 
perturbed energy conservation equation 

<5(V M r^) = (2.59) 

we obtain the following equations for a mixture of non-interacting fluids. 

Scalar equations 

There are two scalar conservation equations, one for the density contrast and the second for 
the velocity perturbation. In terms of Dg >a the conservation equations read: 

D g ,a + 3H(c 2 a - w a )D g , a = -3HT a w a + (1 + w a )AV a , (2.60) 

V a + (1- 3c 2 a )HV a = * + 3c 2 a <t> + (r a + ^D g . a + ^(A + 3/qrO . (2.61) 

Is is sometimes convenient to express the above in terms of the density contrast D a : 

D a - 3w a HD a = (A + 3/C) [(1 + w a )V a + 2Hw a n a ] + 3^^(W 2 +K)(V- V a ) , (2.62) 

c 2 w ( 2 \ 

V a + HV a = * + —^D a + — 2— r Q + -(A + 3/C)n a . (2.63) 
l + w a l + w a \ 3 / 

Vector equation 

We obtain one evolution equation for the vorticity = + : 

111 

+ «n£(i - 34) = 2 rp^ An ^ • (2 " 64) 

If the anisotropic stress source term is absent, we can rewrite the above equation as 

^(^> 1 ~ 34 ) = 0, (2-65) 

hence 

oc a 3 ^" 1 . (2.66) 
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2.2.3 The Bardeen equation 

It is often convenient to have an evolution equation for the Bardeen potential in terms of the 
total matter content. By combining the conservation equation Eq. (2.60) with the Einstein 
equations (2.49-2.51) we obtain a second order equation, called the Bardeen equation, for <1>: 

$ + 3W(1 + cg)$ + [3(c 2 s - w)H 2 - (1 + 3c 2 s )IC - c 2 A] $ = g$ , (2.67) 

where the source term g$ is generated by the matter anisotropic stress and entropy pertur- 
bation: 

g$, = 8irGa 2 P \ntl + [2H + 3H 2 {1 - c 2 s /w)]U + ±AII + |rl . (2.68) 

The above equation can be recast in an evolution equation for the gauge invariant curvature 
perturbation, Eq. (2.27). For hydrodynamical matter, i.e. setting n = and for a flat universe 
(/C = 0) we find 

t=KT-^l c2 s Aq,+ l n2wr } ■ ( 2 - 69 ) 

This expression will be used when discussing the evolution of curvature and entropy pertur- 
bations. 



2.2.4 Collisionless Boltzmann equation 

We briefly recall in the following the basics of relativistic kinetic theory, for more details see 
e.g. de Groot et al. (1980). Consider the phase space given by the the tangent bundle 

T= {Or M ,p M )|z M G M,fP G %} (2.70) 

where M. is the spacetime manifolds and T x its tangent space at the point x^. For a particle 
of mass m, its distribution function /(x^,p M ) is defined on the mass-shell 

V m (xn = G %\p^ = -m 2 } (2.71) 

The Liouville operator C is defined on T, and it gives the evolution of f(x^,p^) along the 
particle world lines, according to the Boltzmann equation 

C[f] = C[f], (2.72) 

which states that the rate of change of / is due to the collision term C [/]. For the purpose of 
studying relativistic particles such as photons and massless neutrinos, we will treat the case 
m = only. The hereby derived equations will then be applied to the description of neutrinos 
and of photons after recombination. Further details and the general case for massive particles 
can be found in e.g. Durrer (1994); Uzan (1998). 

We now proceed with perturbing the left hand side of Eq. (2.72). Its background solution 
was presented in § 1.2.2, and was shown to be of the form / = f(ap), see Eq. (1.45), where 
E 2 = p 2 = PuPug^ ' ■ By splitting the distribution function into a background and a perturbed 
part, 

f(r ] ,x i , P ,n i ) = f(i 1 ,p)+F(r ] ,x i , P ,n i ) (2.73) 

we move to a phase space which differs to linear order from the one of /. Therefore the choice 
of F and its transformation properties depend on the isomorphism relating the "background" 
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and the "perturbed" phase space. By an opportune choice of the isomorphism, it can be 
shown (Durrer, 1994) that under a gauge transformation F transforms as 

F -> F + p^-[HT + 71%] . (2.74) 



It follows that the following variable 

C + n l (Ei-Bi) , (2.75) 



77- 77 d f 

T = F -p— 
op 



is gauge invariant. In terms of the collisionless Boltzmann equation reads 

% + £»' - «< - = 4 [»'*<• + •)] • <™ 

and ^ 3 ^r*fc are the Christoffel symbols of the background 3-space. The above equation is 
in manifestly gauge invariant form, and we notice that spatial variations in the Bardeen 
potential act as source for perturbations in the distribution function. 

By integrating this equation over the particle energies, we obtain a differential equation 
for the brightness perturbation X, defined as 

I = T(rj) + l(r], x\n { ) = 4vr / fp 3 dp + 4vr / Fp 3 dp . (2.77) 

Jo Jo 

The brightness represents the energy per unit solid angle as measured by an observer at 
position x l . The photon energy is just the monopole of the brightness, i.e. 

fh = J^I, (2-78) 
and therefore p 7 = I. Prom Eq. (2.76) we obtain 

1 + (tfdi + m- ( - 3) T i jk n j n k -^j 1 = -41 + $)] . (2.79) 
The above can be rewritten in terms of the temperature contrast 

&(v,x\n i )= 6 ^ = lj (2.80) 
and using the background energy conservation equation we obtain 

6 + (n% - {3) T i jk n^n k -^j = -n%(^ + $) . (2.81) 

This is the Boltzmann equation for relativistic, collisionless particles, which relates gravita- 
tional perturbations to temperature fluctuations of their distribution function. 

The Boltzmann hierarchy 

We now go to Fourier space, and we restrict ourselves to the spatially flat case, K, = 0, so 
that the eigenfunctions of the Laplacian are just plane waves and ^r*/, = (an harmonic 
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decomposition for non-flat spaces can be found e.g. in Vilenkin & Smorodinskii, 1964; Kodama 
& Sasaki, 1984), so that for any scalar / 

f(v, x) = J d 3 kf k)e ikx , (2.82) 

and in general we denote the real space / and its harmonic transform with the same symbol. 
Defining \i = n^kj/k and k = \jkik l we obtain from Eq. (2.81) 

+ ifikQ = -tnk(W + $) • (2.83) 

Assuming that G does not depend explicitly on hi, then the dependence on the photons 
momentum direction comes in only via \x. In that case = @(rj,k,fj,), and we will suppress 
the explicit time dependence. We now perform an expansion in Legendre polynomials 1 

e^,k) = ^2(2e + i)p e et, (2.84) 
i f 1 

@e(k) = -J dfiQMPifjj), (2.85) 

where Pe(x) is the Legendre polynomial of order I, which satisfy 

Po(x)=l, (2.86) 
P l (x)=x, (2.87) 

P 2 ( x )=I(3x 2 -l), (2.88) 

(£ + l)P t+1 (x) =(2£ + l)xP e (x) - £P e -i(x) . (2.89) 

From Eq. (2.83) follows an infinite hierarchy of equations for the moments of the Boltzmann 
equation: 

S + ik@ 1 = 0, (2.90) 
Qi + \ikO + \ik& 2 = -\tk($ + *) , (2.91) 

® £+ 27TT^- 1 + 27Tr^ 0m = o {e ~ 2) - (2 ' 92) 

Gradients of the Bardeen potentials act as a source for the first moment. Because of the 
recursion relation, each multipole moment £ is coupled to the preceding and the following 
moment. Therefore, power is transferred to higher moments, and in principle we need to 
solve an infinite number of coupled differential equations. Simply truncating the hierarchy by 
imposing Qgmax = is not an optimal solution, since the error due to the truncation will reflect 
back to lower moments via the coupling. A more effective truncation scheme is discussed in 
Ma & Bertschinger (1995). We notice that at early times and super-horizon scales (i.e. 
kij -C 1) higher moments are suppressed by successive powers of kr], Qi ~ 0(&i_ikri), and 
hence the first few moments are sufficient to accurately describe the temperature fluctuation. 

1 Different normalizations for the expansion coefficient are commonly used in the literature and their relation 
with the one used here is: in Hu & Sugiyama (1995b) 6 HS = i e {21 + 1)0^ (notice that in this work the 
Bardeen potentials are such that * HS = * but <E> HS = -$); in Ma & Bertschinger (1995) 9 is denoted by 
* and = i l Q t , which is the same convention used by Seljak & Zaldarriaga (1996); in Durrer (1994) 

is denoted by M and Me = 0c/2. 
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Relations with macroscopic quantities 

Prom the definition of the stress-energy tensor (de Groot et al., 1980) 

T^(x a ) = J y?-p»p»f(x a , P > 1 ) (2.93) 

and comparing with Eq. (2.14, page 25), we can establish the hydrodynamical gauge invariant 
variables as integrals over momenta of the gauge invariant brightness perturbation: 

d ^=tJ^ j ' (2 - 94a) 



1 f dQ 

/ ' 

(1 + W-y)p-y J AlT 



Vj = - - | - - — / — n j l, (2.94b) 



U ij = — ^- [ —n^l. (2.94c) 
u> 7 p 7 J Air 

Rewriting the above in terms of multipole moments of the temperature perturbation, we have 
the identities in harmonic space 2 

e = > ( 2 - 95a ) 

0i = - -tkVy, (2.95b) 

6 2 = -^A; 2 n 7 . (2.95c) 

Truncating the Boltzmann hierarchy at the third moment by setting = for I > 3, we 
obtain 

A?,7 + tk 2 Vy = , (2.96) 

% - -D ga = - - A; 2 n 7 + $ + * , (2.97) 

n 7 --F 7 = 0. (2.98) 



Unsurprisingly, we recover the two conservation equations of (2.60-2.61, page 33) for ra- 
diation (with w y = c 2 = 1/3 and T = 0), supplemented with an evolution equation for Il 7 . 
These equations are appropriate for relativistic, collisionless and massless particles such as 
neutrinos. At later times, however, higher order moments need to be taken into account. 
Photons are scattered by electrons, and to describe their evolution we now turn to the ap- 
propriate collision term. 

2 Notice that the monopole of our T corresponds (up to multiplicative constants) to the density perturbation 
in the comoving gauge; in the literature the temperature perturbation in Newtonian gauge is often employed 
(as in (Hu & Sugiyama, 1995b)), in which case an extra term oc $ appears along with Qq . With the 
normalization convention of (Hu & Sugiyama, 1995b), the relation between our monopole and the one in 
Newtonian gauge is Go = Off — All other multipoles I > do not suffer from this ambiguity and are 
gauge independent. 
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2.2.5 Thomson scattering 

We now consider the case of elastic Thomson scattering between photons and non-relativistic 
electrons. We give some elements of the derivation for the collision term for the total photon 
intensity, while we just outline the polarization treatment. A detailed derivation can be found 
in Kosowsky (1996); Durrer (2001). 

Thomson scattering of unpolarized light generates linear polarization if the incident inten- 
sity has a quadrupolar anisotropy. In the tight coupling regime, collisions make the photons 
distribution function uniform in the electrons rest frame, and therefore no polarization can 
arise. However, during the weak coupling regime just before last scattering, the mean free 
path of photons grows and a sizable temperature quadrupole is generated, which acts as a 
source for polarization, as we briefly describe in this section. After decoupling, free streaming 
conserves the polarization state, which can only be changed by further rescattering due to 
reionization, see § 4.1.3.2. 

2.2.5.1 Stokes parameters 

The polarization state of light is usually described in terms of Stokes parameters, see e.g. 
Jackson (1975). The electric field of a plane monochromatic electromagnetic wave propagat- 
ing in the z direction is 

E(x,t) =£e<" t - kz \ (2.99) 
where the complex vector £ describing the polarization state of the wave is given by 



/ a x 



l0.r 



a y e lQ y | . (2.100) 



Instead of using the four numbers (a x ,a y ,9 x ,6 y ), it is convenient to introduce the Stokes 
parameters 

I = a 2 x + a 2 y , (2.101) 
Q = al-a 2 y , (2.102) 
U = 2a x a y cos{6 x - 9 y ) , (2.103) 
V = 2a x a y sm(6 x - 6 y ) , (2.104) 

which can be directly measured with a linear polarizer and a quarter-wave plate. Their phys- 
ical interpretation is straightforward: I gives the total intensity, Q measures the difference 
between x and y polarization, U gives phase information for the two linear polarizations, 
and V determines the difference between positive and negative circular polarization. I and 
V are physical observables independent of the coordinate system, but Q and U mix under a 
rotation by an angle 4> of the x — y plane: 

Q' = Q cos(2<£) + U sin(2<£) (2.105a) 
U' = -Qsin(2<£) + Ucos(2(f)) , (2.105b) 

from which it is easy to derive that the physically observable quantity is the polarization 
vector P, lying in the x — y plane, with magnitude (Q 2 + U 2 ) 1 ^ 2 and with polar angle 
a = I tan -1 K. 
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Figure 2.1: Geometry of the the Thomson scattering process in the rest frame of the electron, 
represented by the sphere in the center. A photon beam is incoming from the left and is 
scattered off with an angle 9. 

Finally, the four stokes parameters are not independent, but satisfy the relation 

/ 2 = Q 2 + u 2 + V 2 . (2.106) 

2.2.5.2 Scattering cross section 

We now consider the scattering process in the rest frame of the electron, with the geometry of 
Fig. 2.1. The Thomson scattering cross section for an incident wave with linear polarization 
£ m into a scattered wave with polarization £ out is 

with o"t the Thomson scattering cross section. It is convenient to work with the partial 
intensities I x and I y , defined as 

Ix^ 1 -^- and I y = LzR. (2.108) 

The incoming wave is unpolarized by assumption, so I™ = PJ 1 = I m /2, and for the outgoing 
wave we find 

jwt = £g 7 m and /° ut = ^/ in cos 2 (60 (2.109) 
or, in terms of the outgoing Stokes parameters 



/ o U t = ^T / in (1+cos 2 w)) (211Q) 
107T 



Q out = 77r-/ in sin 2 (#) , (2.111) 
U oat = 0. (2.112) 
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The value of U out has been found by recalculating Q in an outgoing basis which has been 
rotated by n/4. Thomson scattering does not generate circular polarization, so V = and 
we will not consider it further. Since from (2.106) there are only three independent Stokes 
parameters, and V = all the time, the description in terms of I and Q is sufficient, and we 
wont use U any further. 

The total outgoing intensities are obtained by integrating over all incoming directions, 
and rotating the result into a common coordinate system using (2.105): 

rout _ 3(T T f , nn . r o. 



jout = I d0 (i + cog^fl)) jm^ 0) ; (2.H3) 

167T J 

gout = ^L f dlW(fl)cos(2^)/ in (6>,0) . (2.114) 
lovr J 

2.2.5.3 Temperature hierarchy 

We are now in the position of deriving the collision term due to Thomson scattering for the 
intensity distribution function /, which is of the form 

where / + (x M ,p M ) (/ _ ) denotes the distribution of particles within (Ax^, Ap l /p°) of (x^^) 
gained (lost) in the scattering process. According to the hypothesis of molecular chaos 
(de Groot et al., 1980), the contribution lost is just proportional to the electron density 
times the photon distribution, hence with the definitions (1.40-1.41, page 16) 

^-(x^,p,n 4 ) = f/(^,p,n 4 ), (2.116) 

where 

f = aaTn e (2.117) 

is the differential Thomson optical depth, and n e is the free electron density. The contribution 
scattered into p 1 = pri 1 is most easily evaluated in the electron's rest frame, which we denote 
by a tilde. After averaging over incoming and summing over outgoing polarization states, we 
obtain 

d/ + 



-^/(p J n)w(n,e), (2.118) 



dt 

where the angular dependence of the scattered intensity is, from (2.113) 

u{s,e') = \\l-{e-E>?] = l + - A E ij eV (2.119) 

with e-ij = SiEj — \o~ij- We now transform into the coordinate system, in which the photon 
distribution function / is defined. To first order we have the relations 

p = p(l + n i (4-B i )) , (2.120) 
h = n, (2.121) 



since aberration appears only at second order. We have used the baryon 3-velocity v\, since 
electrons and baryons are electromagnetically coupled and their velocities are the same. Note 
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that the above transformation assumes vj, <C 1, i.e. that the electrons are non-relativistic, 
consistent with the fact that we consider Vb as a perturbation. Splitting the distribution 
function in an isotropic part and a (gauge dependent) perturbation, / = f(rj,p) +5f(x l ,p l ), 
we then compute the energy integrated collision term 



4vr / p 3 dpC [f] = aa T n e 



-A ni (v l b - B l )p^ + <5/5 7 - 51 (n) + -gi %3 8Iij 



(2.122) 



and we have introduced the gauge dependent brightness perturbation SI = Air J dpp 3 5f and 
its second moment 

/j o 
-^■ £ij 6I(e) . (2.123) 

The expression Eq. (2.122) can be brought in explicit gauge invariant form by substituting 
the gauge dependent variables with the corresponding gauge independent counterparts. After 
some manipulations we obtain 



/ 



4vr / p 3 dpC [/] = At 



6 n - riiVi 



e + lnyii« 



(2.124) 



where we have used the identity (2.94c, page 37). In view of adding the collision term on 
the right hand side of the hierarchy (2.90, page 36), it is convenient to rewrite it in terms of 
multipoles of the temperature fluctuation and transform to Fourier space 



4vr J p 3 d P C [/] = -Af p, 



(ikv b + 36i)Pi + -e 2 p 2 + ^2(2£ + i)e e p e 



l>3 



(2.125) 



A few remarks are in order at this point: as a consequence of the conservation of energy in 
the elastic collision, non-relativistic Thomson scattering does not contain a monopole, while 
the dipole corresponds to a velocity mismatch between photons and baryons, as is apparent 
from the first term on the right hand side with 30i = — ikV^. The angular dependence of the 
scattering generates a quadrupole moment. In the limit of very many collisions, f > W, all 
multipoles £ > 1 are driven to zero, therefore in the strong coupling regime, the photons and 
baryons velocity coincide and higher order moments are suppressed: thus the tight-coupled 
photons-baryons system can be described as an hydrodynamical fluid in term of the zeroth 
and first moments only. 

The Boltzmann hierarchy, Eq. (2.90, page 36), supplemented with the above collision term 
for photons-electrons Thomson scattering, now becomes: 



Qo + vkei = 0, 

©1 + it*(e + $ + *) + |tfce 2 = -f(hkv b + ©!) , 



2 3 9 

2 + -ikQi + -tkQ 3 = -f— 2 
5 5 10 



21+ 1 



tke 



e-i + 



£ + 1 
2£ + l 



ikO e+1 = -tQi (£ > 3) 



(2.126a) 
(2.126b) 

(2.126c) 

(2.126d) 



Rewriting the above in terms of macroscopic quantities and cutting the hierarchy at £ = 2 
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gives instead of Eq. (2.96, page 37) 



'9,1 



Day + ^k 2 Vy = 0, 



n. 



3 ' 7 

-Ik 



-f(Vy ~ V b ) , 

9 

-t— if 



10 



7 • 



(2.127a) 
(2.127b) 
(2.127c) 



2.2.5.4 Polarization hierarchy 



As discussed in § 2.2.5.2, photons scattered at a right angle are are preferentially polarized 
along the direction orthogonal to the scattering plane (i.e. in the <?° ut direction in Fig. 2.1 
when 6 = vr/2). Expanding the incoming intensity in spherical harmonics according to 



I 1 



(2.128) 



then the resulting Q on , from (2.114) is 







01 ! t. 



3<7T / 2"7T 



R e /; 



22 , 



(2.129) 



which shows that if the incoming photon intensity as a function of direction has a non-zero 
component of Y" 22 , associated with an £ = 2 quadrupolar moment, then there will be a net 
linear polarization of the outgoing distribution. 

In analogy with the intensity distribution function /, we denote by = f®{r),p) + 
x l ,p, n l ) the perturbed distribution function in phase space and by 6^ the brightness 
perturbation for the Stokes parameter Q, 

1 Jo°° f Q P 3d P 



Q Q = -- 

4 J °° FQpMp ' 

Then the collisional Boltzmann equation for the brightness perturbation 
is (Bond & Efstathiou, 1984; Kosowsky, 1996) 



e Q + ik[iO Q = -f 



(2.130) 
in Fourier space 

(2.131) 



Expanding the equation in Legendre polynomials as in Eq. (2.85, page 36), we obtain the 
Boltzmann polarization hierarchy: 



9 J + ikO® 



+ 263 



@7 + 



+ -ik 

@9 + -ike? + -ike? 
1 5 1 5 6 

.,kB Q 1 i + 1 7 j,e Q 



e 2 + + 



T 
"10 



96^ - e 2 + 



>3). 



2£+ 1 

Polarization effects also feed back into the temperature collision term, modifying the 
equation in the temperature hierarchy (2.126) as follows: 



(2.132) 

(2.133) 

(2.134) 

(2.135) 
2 



2 3 t 

@2 + ^k&i + -ik@ 3 = - — 
5 5 10 



96 



e 2 + e^ 



(2.136) 
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2.2.5.5 E and B polarization 

Prom the the hierarchy of equations (2.132) it is possible to determine the brightness pertur- 
bation for Q today, and define the corresponding power spectrum. However, the approach 
using Stokes parameters is limited by the fact that U and Q are not rotationally invariant, 
but are defined with respect to a fixed coordinate system on the sky. Not only the superpo- 
sition of different modes is cumbersome because of the behavior of Q and U under rotation, 
but the coordinate system becomes ambiguous and ill-defined on the whole sky, since it is 
impossible to define a rotationally invariant orthogonal basis on the two-sphere. 

The solution is to construct two spin 2 quantities from Q and U, which one then expands 
in the appropriate spin-weighted basis on the two-sphere (Zaldarriaga & Seljak, 1997), and 
reduces to scalar quantities by acting on them with spin raising and lowering operators. 
This manipulations yield two scalar quantities which are rotationally invariant, and therefore 
well defined on the whole sky. Furthermore, one can expand these quantities in terms of 
usual spherical harmonics and build two linear combinations which behave differently under 
parity transformation: the combination labelled E, in analogy with the electric field, is 
invariant under a parity change, while the £>-type combination changes it sign, analogous to 
the magnetic field. Another terminology, sometimes found in the literature, is C mode for 
"curl" (corresponding to the £?-type) and G for "gradient" (corresponding to the £7- type). 

Another advantage of this decomposition is that only the cross-correlation between E- 
polarization and temperature is needed, since the cross-correlation between B and E or T 
vanishes since B has opposite parity. Furthermore, scalar modes do not generate B polar- 
ization, due to the peculiar \x dependence of Thomson scattering, while tensor modes do. 
Therefore, the separation of the polarization signal in E and B modes is useful to separate 
scalar from tensor contribution, and to identify foreground contamination or a lensing signal, 
which can convert E polarization into B polarization for scalar modes. 

We do not give explicit expressions here, which are rather technical and are not needed 
in the following, but refer the reader to Zaldarriaga & Seljak (1997) instead. A similar 
decomposition, but with a different normalization has been proposed by Kamionkowski et al. 
(1997). 
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COSMIC MICROWAVE BACKGROUND 



Chapter 3 

Fundamental equations 



The all sky picture of CMB anisotropy delivered by COBE and more recently and with 30 
times more resolution by WMAP can be considered as a fingerprint of the early Universe. 
More precisely, it is an accurate reproduction of the fluctuations in the radiation-matter 
mixture at the epoch of recombination. 

In this section we succinctly explain the origin of this picture, by starting with the be- 
havior of scalar perturbations in a Universe containing one perfect fluid, § 3.1; many of the 
fundamental features of the anisotropies can be understood in a simple model with a mixture 
of radiation and matter which are coupled only gravitationally, as demonstrated in § 3.2 where 
the concepts of adiabatic and CDM isocurvature initial conditions are introduced; adding a 
massless neutrino component yields two new growing modes, the neutrino entropy/density 
and velocity isocurvature solutions, derived in § 3.3. Although the results of those two sec- 
tions are already known in the literature, the derivation presented in this work is original. 
We then refine the picture of acoustic oscillations by including baryons in § 3.4, and sketch 
the origin of damping in § 3.5. Finally we derive the line of sight solution for the observed 
temperature fluctuations today and introduce the CMB angular power spectra in § 3.6. The 
understanding and tools developed in the following will build the basis for the next chap- 
ters, where parameter extraction techniques will be discussed (Chapter 5) and applications 
presented (Chapters 6 and 7). 

There is a rich literature on the cosmic microwave background but unfortunately an up- 
dated work which encompasses both and introduction to the field and more advanced ma- 
terial, covering the rapid evolution of the last few years, is presently lacking. Throughout 
this and the next chapter we give ample references to the classic and more recent research 
papers; as background material, Lineweaver et al. (1997) is a valuable source which presents 
an introduction to the CMB theory as well as some observational issues; Durrer (2001) is 
built on a gauge invariant formalism similar to the one used here; Partridge (1995) is a good 
introductory overview written at the onset of the recent data-driven epoch. A rather complete 
review of both theory and data analysis is offered by Hu <fe Dodelson (2002). 

3.1 One perfect fluid 

We begin by examining the behavior of scalar perturbations in a flat (/C = 0) universe which 
contains a single perfect fluid, described by w = (? s = const, and T = IT = 0. 
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Since the anisotropic stress vanishes, from Eq. (2.51, page 32) it follows VP = The 
evolution of the perturbations is given by the two conservation equations (2.62-2.63, page 
33) supplemented by the Poisson equation (2.49, page 32), which in Fourier space read: 



D -3wHD = -{l + w)k 2 V , 

„2 



V + HV = 



1 + w 



-k 2 m = -n 2 D. 



(3.1) 
(3.2) 
(3.3) 



These equations can be combined into a second order equations for the density contrast: 

(3.4) 



D + (1 - 3w)HD - j^ 2 (l + 2w- 3w 2 )D + c 2 k 2 D = 



By defining a new variable x = krj and the parameter v = 2/(1 + 3w), we obtain the following 
equation for T> = Dx u ~ 2 



dx 2 x dx 



2 



V 

-2=0, 

x z 



(3.5) 



For c 2 7^ the solution is a linear combination of spherical bessel (j u ) and von Neumann 
(n v ) functions of order v (Abramowitz & Stegun, 1970) 



f = C±j u (c s x) + C 2 n u (c s x) = Z v (c s x) . 
Therefore the general solution of Eqs. (3.1) is 



D = x 2 u Z v {c s x) , 



)X 1 " + „ 2 _ ; X 2 V Z v -\{c s x) 



3v{\ + u) 



v x h ' Z v {c s x) . 



(3.6) 

(3.7) 
(3.8) 

(3.9) 



The asymptotic behavior of the Bessel and von Neumann functions is 



j v oc x v for c s x -C 1, 



j v oc — cos(c<jX — 7„) for c s x S> 1, 



-0+ 1 ) f or CsX <g J nj/ _ s in(c s x — 7„) for c s x S> 1. 



(3.10a) 
(3.10b) 



with 7,/ = 7t(z^ + l)/2. For an expanding universe (x > 0) and z/ > — 1 (i.e. w < -1 or 
u; > — 1/3) n v is divergent at early times, c s x <C 1. Therefore we set C2 = and we obtain 
the asymptotic solutions (for w > —1/3) 



D 

kV 



* = * 

2^o 2 



3 v 2 
2 



for c s a; -C 1 



(3.11) 



* x 
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and 



D 

kV 



V x 



-(!+") 



cos(c s x + J u 



--— 5- a; cos(c s x + 7 v J 



for c s x 3> 1. 



(3.12) 



(1/ - 2)* 







cos(c s x + 7„_i] 



3(1 + 1/) 

This solution was first discovered by Bardeen (1980). The Bardeen potential is constant on 
super-horizon scales, and decays once inside the acoustic horizon. On scales smaller than 
the acoustic horizon (c s x <C 1) density perturbations oscillate: the gravitational attraction is 
resisted by the fluid pressure (w / 0) and this sets up acoustic oscillations. The amplitude of 
density and velocity fluctuations remains constant inside the horizon in the case of radiation 
(u = l,w = 1/3), while it increases for w > 1/3 or w > —1/3. The behavior of the density 
and velocity perturbations on scales larger then the horizon depends on the variable under 
consideration. While D, corresponding to the density contrast in the comoving gauge, is 
growing, the density contrast in the flat slicing gauge D g remains constant. Therefore there 
is no universal criterion to establish the growth of perturbations outside the horizon: the 
behavior depends on the chosen gauge. As we go to early times, x — > 0, perturbation theory 
remains valid as long as it is possible to find a gauge in which the largest perturbation variable 
does not diverge. We come back to this point in § 4.3, where we derive the most general 
initial conditions. 

The case of dust w = c 2 = has a power-law solution on all scales. It suffices to remark 
that Eq. (3.5) reduces to 



0. 



whose general solution is D 



dx 2 xda; x* 
Ax 2 + Bx~ 3 . The growing exact solution is therefore 



(3.13) 



D 



6 U 



for dust, w = 0. 



(3.14) 



1 



kV = -^nx oc a 1/2 
3 



Clearly, in a dust universe perturbations always grow on sub-horizon scales, since there is no 
pressure to counterbalance the gravitational attraction. 



3.2 Cold dark matter and radiation 

In this section we investigate the evolution of perturbations in a flat universe containing only 
radiation and a pressureless matter component which is decoupled from radiation. Thus the 
matter has only a gravitational effect and represents a cold dark matter component. In the 
next section we include massless decoupled neutrinos in the picture, while the role of baryons, 
which are coupled to photons via Thomson scattering, is investigated in § 3.4. 



3.2.1 Adiabatic and isocurvature modes 

In this section we use as density variable the density contrast in the total comoving gauge 
A Q , defined in Eq. (2.37, page 30). We identify the radiation with photons (subscript 7), 
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and we have w 1 = c 2 = 1/3, while for matter w rn = (? m = 0. We normalize the scale factor 
at the matter-radiation equality, so that 

Pm( a eq) = P-y( a eq) with a eq = 1 hence = a • (3.15) 

The total equation of state parameter and sound velocity are therefore 

^ = Q TT aild °S = Q /I i Q ■ 3 ' 16 

3a + l 34 + 3a 

As long as we are considering times well before decoupling, the photons form a tight coupled 
fluid with baryons, since Thomson scattering prevents the generation of anisotropic stress 
(and higher multipoles in the Boltzmann hierarchy) in the photons component, ILy = 0, as 
we show in § 3.4. Therefore, via the anisotropic stress equation (2.51, page 32), the Bardeen 
potentials are equal, * = The Bardeen equation for <1> (2.67, page 34) is then 

$ + 3W(1 + c£)$ + 3(c 2 - w)H 2 $> = c 2 s A$ + lH 2 wT , (3.17) 

where V = T re \ is related to the relative entropy perturbation S = 5 m7 = A m — |A 7 by 
Eq. (2.41, page 30). By using the Poisson equation we can rewrite the above as an equation 
for the total density contrast, 



■Hr 2 b + (1 - 6w + 3c 2 s )n^D - |(1 + 8w - 3w 2 - 6c 2 )D 

H 



-c 2 



[D-3c 2 {l+w)S] 



where we have introduced c 2 = p^pm^c 2 — c^J/ [(1 + w)p] = a /(3a + 4). 

The energy conservation equation (2.60, page 33) reads for the radiation and matter 
components: 

A?,7 + \ k2y i = (radiation), (3.19) 
D g ,m + k 2 V c = (matter). (3.20) 
Subtracting (3.20) from (3.19) and using that 

7^-7^ = T^-7^-=S«/» ( 3 - 21 ) 

1 + W a l + Wp l + W a l + Wp 

we obtain 

S = -k 2 (V m - Vy) . (3.22) 

In order to find an evolution equation for the entropy S, we derive (3.22) and making use of 
the momentum conservation equation (2.63, page 33) after a lengthy manipulation we arrive 
at 



H~ 2 S + (l-3c 2 z )H- l S 



-D - c 2 S 



(3.23) 



.3(l + u;) 

Together, Eqs. (3.18) and (3.23) describe the evolution of adiabatic (curvature) and isocur- 
vature (dark matter) perturbations in a flat universe containing only dark matter and radi- 
ation. 
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We start by considering large scales (k <C TL) at early times, a — ► 0. Then the right hand 
side of (3.18) and (3.23) is negligible, thus D and S are decoupled. Using the scale factor a 
as variable, we obtain an homogeneous system 

j2 



2 n 



2D = 



2 d c , d c 

a -r-^S + a—S 
dcr do 



(Large scales, radiation epoch) 



(3.24) 







whose general solution consists of four modes 

f D 



D a 2 + D ia - 1 



S = So + S\ In a 



(3.25) 



We will call the mode with Do / 0, D\ = So = S\ = the growing adiabatic mode, while the 
one with So 7^ 0, Do = D\ = S\ the growing isocurvature mode (notice that for a < 1 the 
Si mode is indeed decaying). As we show below, the isocurvature mode at early times has 
vanishing total density contrast, Bardeen potential curvature perturbation, C, = 0, hence its 
name 1 . 

Consider first the growing adiabatic mode: we can now restore the solution for D in the 
source term on the right hand side of Eq. (3.23) to find the solution for S up to second order 
in k/TL. The Bardeen potential is easily found from the Poisson equation, and the result is 

,2 



D 



S 



kV 



D a 

Do 
64 



a oc a 



3D 



IT) 



I — const (adiabatic, radiation epoch). 



(3.26) 



1 k * 
-— $ oc a 

2H 



C 



9D 



Ha 



const 



Clearly, we recover the behavior already found in the single radiation fluid case for the 
potential. We also discover that the entropy perturbation grows as a 4 , but remains negligible 
on large scales, thus the adiabaticity condition S w is maintained on large scales. 
For the growing isocurvature mode we find, to the same approximation 





D = 


s fky 




S = 


So 


< 


$ = 


So 

a 

8 




kV = 


So k 

a oc a 

8 H 

3So 

a 

16 




C = 



(isocurvature, radiation epoch). 



(3.27) 



lr Tlie CDM isocurvature mode is sometimes termed "isothermal" in the literature: this comes from the fact 
that D — implies 4jr = — ^-A m m at early times. Intuitively, it takes only a small perturbation in the 
radiation component to compensate for a fluctuation in the matter at early times, because the Universe is 
radiation dominated. 
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We see that there is no generation of entropy on large scales (S = 0), however the isocurvature 
condition <E> « is maintained only as long as a <C 1. Naively we would expect that, as long 
as the scale considered is outside the horizon, the term containing S on the right hand side of 
Eq. (3.18) is suppressed as k 2 /H 2 , thus D (hence $) should not grow significantly. However, 
since $ oc 7i 2 /k 2 , effects of magnitude k 2 /H 2 in D are significant for <E>. This can be seen 
more directly by rewriting the right hand side of Eq. (3.17) as —c 2 k 2 /TC 2 Q — 2(1 + w)c 2 c 2 S. 
Therefore even on super-horizon scale the term oc S act as a source for whenever c 2 c 2 is 
significantly non-zero. This is the case during the transition from the radiation to the matter 
dominated epoch. 

Having established the behavior in the early epoch, we now turn our attention to scales 
which enter the horizon when the universe is well matter dominated, i.e. to wavelengths such 
that 

k < k eq = H{a eq ) . (3.28) 

The effects of the radiation-matter transition are easiest to discuss by looking at the behavior 
of the curvature perturbation £. To this end we rewrite the evolution equation (2.69,page34) 
as 



C = ~c 2 s H 



2 k 



3(1 + \n 



$ + 3c 2 S 



(3.29) 



The term oc $ on the right hand side is always negligible on super-horizon scales (k/TL <C 1); 
for adiabatic perturbations we also have 5 = 0, and thus we obtain 

( = const (adiabatic, all times), (3.30) 

the usual conservation law for ( in the adiabatic case. For the isocurvature mode (S = So = 
const) we find by integration 

C = — 3Sb / — c 2 c 2 — ► — So (isocurvature, matter epoch). (3.31) 

Jq a a-^oo 3 

The radiation-matter transition generates a curvature perturbation from the initial isocurva- 
ture one, and this even on super-horizon scales. 

Since C = const in the matter era independently on the initial conditions, we can find 
the value of the Bardeen potential in the matter epoch simply by integrating the definition of 
the curvature perturbation, using that w = const as well. We then obtain the relation (valid 
only in the regime where £ = const, w = const) 

3(1 + id) 5+3w 
5 + 3w 

and we can drop the second term, which is decaying for w > —5/3. Therefore 

$(a 3> a eq ) = const = |C (matter epoch, independent of IC). (3.33) 

For the adiabatic mode, £ = const in the radiation era as well, therefore we can apply (3.32) 
with w w const = 1/3, getting 

<l>(a <C a eq ) = const = |C (radiation epoch, adiabatic). (3.34) 
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Let us denote by $o the value of <1? at the moment when the initial conditions for the 
perturbations are specified, deep in the radiation era. The adiabatic mode corresponds to 
So = 0,$o / 0, while the isocurvature mode has So / 0, <3?o = 0. From (3.33) we know that 
$ is constant on super-horizon scales in the matter era, independent of the type of initial 
conditions; we denote its value by <3?md, and we wish to express it in terms of Soj^o- For 
adiabatic perturbations, ( stays constant through the transition, and therefore combining 
(3.33) with (3.34) 

$md ~ in^o (adiabatic, large scales). (3.35) 

For isocurvature perturbations, the growth of C through the transition gives a non-zero 3> in 
the matter epoch, from (3.33) and (3.31) : 

$md ~ — <So (isocurvature, large scales). (3.36) 
5 

In conclusion, we can summarize our results in terms of a transfer matrix as 

(3.37) 







a>a eq 

It is often useful to use the curvature perturbation as a variable describing the adiabatic 
mode, instead of In terms of the initial values of the curvature and entropy perturbations, 
(Co; So), the final values in the matter era are given by a transfer matrix of the form 

(3.38) 

a>a eq 

From the above analysis, we conclude that for scales k <C k eq the transfer coefficients are 

% = 1, T CS = ~, T ss = l. (3.39) 

For smaller scales, which enter the horizon before the universe is completely matter domi- 
nated, the coefficients have to be found numerically. 

3.2.2 Acoustic oscillations 

We have seen in § 3.1 that perturbations in a fluid of photons oscillate on scales smaller than 
the horizon. We now discuss the corresponding behavior in the presence of matter, and link 
the phase of the oscillations to the adiabatic or isocurvature initial conditions on large scales. 

Neglecting the anisotropic stress, II 7 = 0, the conservation equations (2.60-2.61, page 33) 
for photons read 

D gri + ±k 2 V y = (3.40) 
V 1 - \D g>1 = 2$ (3.41) 

where <3? can be considered as an external potential determined by the Poisson equation. We 
can recast the above in a second order equation for the density perturbation: 

D gn + c?k 2 D gn = 2$ . (3.42) 
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Adiabatic initial conditions 

Let's consider Eq. (3.42) deep in the matter era, when the driving force is just a constant set 
by the dominating matter contribution in the adiabatic case. Then the general solution of 
Eq. (3.42) is 

A/,7 = C\ cos(c 7 /c7?) + C 2 sin(c 7 /c7?) - 8$> (3.43) 
kV^ = - — [C\ sm(c^krj) — C2 cos(c 7 £;7?)] . (3.44) 

For small scales, where all choices of density perturbation are equivalent, we recover the 
oscillatory behavior already found in § 3.1. The density perturbations perform harmonic 
oscillations around a zero point displaced by a constant factor. 

The constants C± and C2 are fixed by the initial conditions, adiabatic or isocurvature, 
established by matching the above solution on large scales with the results of the previous 
section. To this end, we shall use the following relation between D gn and A 7 , which follows 
from the definitions of the variables: 

\D ga = ±A m - iS - HV - $ . (3.45) 

From the momentum conservation equation (2.63, page 33) we obtain for the total velocity 
perturbation in the matter era 

V + HV = <f>, (3.46) 

with solution 

V = V 1 a~ 1 + -H~ 1 ^. (3.47) 
o 

The term oc a -1 is decaying, therefore we retain V ~ Inserting this into Eq. (3.45) 

and using that in the matter era <3? = 9/10<I>o — So/5 we obtain on large scales, where 
A m ~ {k/H) 2 § < $, 

\D ga (a > a eq ) « const = -|$o • (3.48) 

Thus on large scales and in the matter epoch, D ga is independent of the entropy perturbation, 
and is simply related to the primordial Bardeen potential. 

The adiabatic mode stays decoupled from the isocurvature mode on super-horizon scales, 
therefore we can set the initial conditions for the solution (3.43-3.44) by taking its constant- 
time super-horizon limit, i.e. k — ► 0, 77 = const S> r] eq . This gives, with So = 

\D g „ = \C X - 2$ MD (3.49) 

and comparing with Eq. (3.48) and using again (3.35) we obtain 

Ci = |$md • (3.50) 

The constant C2 is set by noting that the adiabatic condition S = is preserved on 
super-horizon scales, and that, because of energy-momentum conservation for matter and 
radiation, this implies 

V J = V m . (3.51) 

Since 

^ir^+irk 1 ''" (3 ' 52) 
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we have that V ~ V m for a 3> a eq , and with (3.47) it follows that 



V 7 = V n , 



Comparing this with the large scale limit of Eq. (3.44), 



lim ^7 = T 

k— >0,??=const 4 



Ci - C 2 lim 



cos y 



y^O y 



(3.53) 



(3.54) 



we see that we need to impose C 2 = 0, otherwise Vy would diverge in the large-scale limit 
y — ► 0, and we recover again (3.50) by using TL = 2/rj\ 



In conclusion, the adiabatic solution is 

4 



Ci = |$md and C 2 = 0. 



D 



!)■- 



<l>cos(c 7 fe?7) -8$ 
3 (adiabatic). 



(3.55) 



(3.56) 



kVry = c 7 $ sin(c 7 A:r/) 
Isocurvature initial conditions 

As we have seen in the previous section, $ = is no longer maintained in the matter era for 
isocurvature initial conditions. It is therefore convenient to solve (3.42) at early times in the 
radiation regime, where we know that the driving term on the right hand side is <1> oc rj (cf. 
Eq. (3.27, page 51)): 



D gn = C\ cos(c 7 /cr/) + C 2 sin(c 7 /cr/) - -fe 2 rj eq 1 S ri , 

1 9 

kV 1 = — [Ci sm(c.ykri) - C 2 cos(c 7 /cry)] + — k^rj^So ■ 



16 



(3.57) 
(3.58) 



The constants C\ and C 2 are determined by looking at the early time limit on super- 
horizon scales, rj — > 0, k = const <C k eq . From the early-times solution (3.27) we have that 
D ga — > for rj — > 0, and therefore we need to set C\ = 0. The early time limit for Eq. (3.58) 



gives 



C 2 

rj— >0,fc=const 4u 7 



lim fey, 



9 



~ A 1" 77 k 3r 1ea S ) 



' 16" ,eq 

while from the isocurvature solution (3.27) combined with (3.52) we have for a <C a eq 



lim 



fey 7 = feV oc rj 1 -> . 



ij— >0,fc=const 

By requiring that the left hand side of (3.59) vanishes we conclude that 

C 2 = -^-fe- 3 r ?ea 1 5 . 



(3.59) 



(3.60) 



(3.61) 



In conclusion, isocurvature initial conditions excite a sine oscillation in the radiation den- 
sity: 



D 



an 



fey 



fe 2 rj e qSo V3k sin (c 7 fer/) — rj 



4" 'eq 

3^ 



16 



fe 2 r/ eq 1 5'o v^fe cos (cjkrj) — 1 



(isocurvature). 



(3.62) 
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An heuristic argument (Hu & Sugiyama, 1995b) explains why adiabatic initial conditions 
excite the cosine mode while isocurvature initial conditions produce the sine mode: at early 
times, the potential acting as a driving force on the right hand side of Eq. (3.42) is constant 
for adiabatic initial conditions, while it is cx r\ in the isocurvature case. This mimics a cosine 
and a sine forcing term, respectively, and thus the corresponding modes get excited. An 
approximated analytical solution valid until recombination and through the radiation-matter 
transition can be found in Hu & Sugiyama (1995a). 



3.3 Neutrinos and initial conditions 



In this section we extend the above treatment to include massless neutrinos. They are 
described as an additional relativistic component, which is decoupled from the others below 
a temperature of a few MeV, and therefore their distribution function obeys the collisionless 
Boltzmann equation. We shall see in the following that the anisotropic stress created by free 
streaming of neutrinos considerably complicates the simple picture of the previous section. 

By including one more component in the mixture, we generally expect two additional 
modes to arise, which we will be able to identify with the so-called "neutrino isocurvature 
density" (NID) and "neutrino isocurvature velocity" (NIV) modes. In the following, we shall 
refer to both of them as to "neutrino isocurvature modes" 2 , and we will sometimes call the 
neutrino density mode "neutrino entropy", which is a more appropriate definition in our view. 
These two modes were first found by Bucher et al. (2000), who solved a formal expansion in 
powers of r\ of the Einstein and conservation equations at early times and on large scales (i.e. 
for -qk — > 0) in synchronous gauge, an analysis repeated in the gauge invariant formalism in 
Trotta (2001). The approach we propose here offers a more physical understanding and the 
approximations we employ could be extended to a refined analytical model of the sub-horizon 
structure of the neutrino modes angular power spectra. We explicitly give some details of 
the derivation, since to our knowledge this calculation is new. 

We argue in § 3.3.4 that an "anisotropic stress mode", which is characterized by a non- 
vanishing Tl u at early times, is non-physical, since it leads to incurable divergences in the 
perturbation variables. 

3.3.1 Evolution equations for a three components model 

In the presence of neutrinos, the background radiation energy density is written as 



Pr = P-y + Pv = P 7 (l + r u ) , (3.63) 

where we have defined the constant r u = (7A y /8)(4/ll) 4 / 3 « 0.68 for N v = 3 neutrino 
families. As before, the scale factor is normalized to matter-radiation equality, the conformal 
Hubble parameter is 

= Uj£ = (1 + 7a^ 

r] + rj 2 /4: 7a K ' 



2 The term "isocurvature" is somewhat abused for the neutrino density mode, see the remark after Eq. (3.93) 
on page 61. We nevertheless employ this terminology for simplicity and consistency with the literature. 
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and the cosmological parameters as a function of the scale factor are of the form 

VL v (a) 

Q-y(a) 



(l + r„)(l + a) 
1 

(l + r„)(l + a) 
a 



(3.65) 
(3.66) 
(3.67) 



(1 + a) 

We still neglect the dynamical effect of baryons, which to lowest order is unimportant, but 
continue to assume that Thomson scattering drives to zero all multipoles I > 2 in the Boltz- 
mann hierarchy for photons, which are then described as a relativistic perfect fluid. Neutrinos 
become collisionless after neutrino decoupling, therefore the fluid approximation is insuffi- 
cient. A neutrino anisotropic stress is generated by free streaming and to lowest order we cut 
the Boltzmann hierarchy for neutrinos, Eq. (2.96, page 37), by setting to zero all moments 
> 3. The goal is to derive second order evolution equations for the three relevant and physical 
quantities: the total density contrast D, the entropy perturbations in the dark matter, S m7 , 
and in the neutrinos, S U j, supplemented by an evolution equation for the neutrino anisotropic 
stress. 

The source term in the Bardeen equation is modified in two ways: there is an additional 
entropy contribution coming from the neutrino entropy perturbation S u ^, and we have to 
take into account the anisotropic stress term. This gives for the evolution equation of the 
total density contrast D (compare with (3.18, page 50)) 



H~ 2 D + {l-6w + 2,c 2 s )TL~ l D - |(1 + 8w - 3w 2 - 6c 2 s )D 




ciD-3c 2 s c 2 z (l + w){S,, 



J rrvy 



-s, 



+ 



2rv 



3(l + r„)(l + a) 



1 + r„ 

ntl u - [(l + 3w) -3c*|' 



1 



Equation (3.23, page 50) acquires extra terms coming from S u -y, reading 

1 



D 



r 2( l 



4wr h 



(3.68) 



-S v 



3(1 + ™)" " z " mi 3(l+u/)(l + r„) V1 

(3.69) 

In deriving the above equations we have made use of (2.38, page 30) and (3.22, page 50) 
together with the following useful relations: 

-D —r. z r~7~ ~Syy - ■ zr-S m -y , (3.70) 



1a. 



fcVv 



3(1 +w) 
kV 



3(4 + 3a)(l + r, 
4r„ 



4 + 3a 
3a 

'm 



-iy n 



V, 



(3.71) 



(4 + 3a)(l+7v) v " 4 + 3a' 

We obtain an equation for the neutrino entropy perturbation by deriving the difference of 
the momentum conservation equation for neutrinos (Eq. (2.97, page 37) written for v instead 
of 7) and the momentum conservation for the photon fluid, (3.19, page 50), with the result 



h 2 

Q +—Q - _TT 

"J I/7 T ^ Ul/7 g '-'-V 



(3.72) 
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The coupled system (3.68), (3.69) and (3.72) describes the evolution of adiabatic and 
entropy perturbations in a mixture of photons, dark matter and radiation, once we specify 
Tl u . However, on super-horizon scales and for early times, k/TL -C 1, the anisotropic stress is 
unimportant, since from (2.98, page 37) written for v instead than for 7, it obeys 



d , 2tt 8 k , Tr 
a—kU u = --kV v 
da 5 n 



0. 



(3.73) 



which shows that on super-horizon scales there is no generation of anisotropic stress, a result 
expected on the grounds of causality arguments. At earlier times, the neutrinos were coupled 
to electrons via weak interaction processes, which isotropized the neutrino distribution func- 
tion suppressing any appreciable anisotropic stress; hence we can assume that at the time 
under consideration (just after neutrino decoupling) there is no anisotropic stress to zeroth 
order in powers of a, i.e. Tl u = 0(a) at least. 

In the above approximation and for aClwe thus obtain the simple system 

,d 2 



2D = 0. 



x_ d Q 

+ a—b. 
da 

2 d 2 



a 



m-y 



d 

a—b, 
da 

0. 



(3.74) 



whose general solution consists of six modes 

D 



D a 2 + D x a~ l 



s 



So + Si In a + N v a . 
S U1 = N d + N v a . 



(3.75) 



We recognize the growing and decaying adiabatic (the Dq and Di terms, respectively) and 
isocurvature dark matter (So an d Si terms, respectively) modes, and we also find two new 
non-decaying modes, a constant neutrino entropy mode A^^, and a neutrino velocity mode 
N v a (the reason for this terminology is explained below). 

In order to go beyond this large scales solution, we need to include the effect of the 
anisotropic stress. To this end, we recast Eq. (3.73) by substituting kV u with 

3(1 + r u )a d 



kV v = kV 



aH 



1 + r v da 1/7 



"da Sm7 



(3.76) 



From now on we drop the last term on the right hand side, which is always suppressed by a 
power of a except in the dark matter isocurvature case, which we do not investigate further 
here. For the total velocity, the constraint equation (2.50, page 32), combined with the the 
anisotropic stress equation (2.51, page 32) and the Poisson equation (2.49, page 32) yield, in 
the early time a <C 1 limit 



kV 



H 

T 



-D 



3a d 



D 



-k 2 U u 



A 4 da l + r u 
The evolution equation (3.73) for the anisotropic stress then reads, for a <c 1 



a— k 2 U u + - 



da 



51 + rv 



-k 2 U,. 



5 5 da 



8a 



5(1 + r v ) da 



S 



(3.77) 



(3.78) 
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In the same limit and in terms of the scale factor a, the equations for D and S V1 become 
(dropping the last term oc k 2 H u on the right hand side of (3.68) which is always negligible 
compared to the others): 



a 2 ^D 
d 2 a 



2D 



S, 



2r„ 



3(1 + r v y V ^ 



3(1 +r„) 



a— k 2 U v 
da 



2rn, 



6 v n 



(3.79) 
(3.80) 



The system of coupled differential equations (3.78), (3.79) and (3.80) is too difficult to solve 
analytically. To find an approximate solution valid to leading order in powers of a for early 
times, we treat the anisotropic stress iteratively as a perturbation to the large scale solution, 
Eq. (3.75), in analogy with the procedure in Hu & Sugiyama (1995a). More specifically, we 
use the large scale solution for D and S U1 as a source on the right hand side of Eq. (3.78) to 
determine the anisotropic stress, then we re-insert the solution for Ii u on the right hand side 
of (3.79) and (3.80) to find self-consistent corrections to the large scale behavior. 

As an illustration, let us first consider the adiabatic growing mode, D = Dqo 2 , D± = Sq = 
Si = = N v = 0. In that case, the right hand side of (3.78) is dominated by the terms in 
D, giving 



^ + 5ttt: 



k 2 U v 



6 -D a 2 
5 



(3.81) 



which has the particular solution 



k 2 n L 



3(1 +r u )D 2 

a 

7r u + 5 



(3.82) 



Notice that, although the above form of Yl u oc a 2 is of the same order as the adiabatic solution 
D oc a 2 , its contribution on the right hand side of (3.79) cancels out because of the factor 
2 in the exponent. Thus it is consistent to have neglected the anisotropic stress in the first 
place when deriving the large scale solution. 

With the above approximation for 11^, from (3.80) we can determine the growth of neutrino 
entropy perturbations in the adiabatic mode, finding to leading order in powers of a 



S, 



1/7 



(l+r„)A) fk_ 
48(7^ + 5) \H 



a oca «D. 



(3.83) 



The growth of the dark matter entropy perturbation is also modified by the coupling to 
the neutrino entropy perturbations on the left hand side of (3.69, page 57), but the term 
oc S U1 oc a 4 has the same scaling as the term oc D on the right hand side, and the approximate 



solution is 



J m-y 



1 
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1 



Do 



a 2 oc a 4 <C D . 



(3.84) 



3(7?v + 5) 

In conclusion, the growing adiabatic mode at early times in the presence of neutrinos and 
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anisotropic stress has the approximate solution (compare with the solution (3.26, page 51)): 



D = Dna z 



S m *y OC 



2 4 

a oc a 



S u -f oc ( — I a oc a 



H 



k II,, oc a 



3 / ka 

2 1 n 



Dq = $o = const 



* = $o + 



3r u ( ka 



7(1 + r„) \H 



I — I = = const 



1 k 

kV = $o oc a 

27i 



C 



9D (Ha 
T 



const 



(adiabatic). 



(3.85) 



The Bardeen potentials are no longer equal due to the anisotropic stress, the fractional 
correction being 



2 TV 



71 + r, 



0.1, 



(3.86) 



of order 10%, in good agreement with Hu & Sugiyama (1995a) 



3.3.2 Neutrino entropy mode 

Let us now turn our attention to the Nd / mode, with N v = Dq = D\ = Sq = S% = 0: this 
is clearly a neutrino entropy mode, since S U7 = const for a — ► 0. 

To determine the growth of perturbations in the total density D beyond the large scale 
solution D = 0, consider the right hand side of Eq. (3.79): if the anisotropic stress goes at 
least as a 2 , then the part containing n„ cancels (for Ii u oc a 2 ) or is subdominant with respect 
to the S U1 term (for Il„ = C(a 3 ) or higher). In any case, we can neglect the anisotropic stress 
term as a source for D with respect to the neutrino entropy perturbation, with the caveat 
that at the end of our calculation we have to check that this assumption is satisfied - indeed, 
cf. Eq. (3.89). By this argument, we look for a particular solution of 

a^D - 2D = - (£) 2 ^-r^-iV, , (3.87) 
d z a \H J 3(1 + TV ) 



which is given by 



g = " 9(1 + ^) ^(^1 ln(a)«a 2 ln(a). (3.8? 



The logarithmic dependence can be neglected if we do not apply this solution over a too 
large time range (say, less than a few orders of magnitude), and replaced by the value of 
ln(a) evaluated at the typical value of the scale factor in the range considered, a*, which we 
reabsorb in the overall normalization by defining a new constant iVJ = iVdln(a*). 
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We can now solve for IIj, by inserting the above expression for D in Eq. (3.78, page 58), 
and observing that on the right hand side = 0, thus obtaining 

k2n "= N '*($w^« a2 - (3 - 89) 

which is consistent with our initial assumption for IL,. 

Finally, the Bardeen potentials follow from the Poisson equation and the anisotropic stress 
equation, yielding 

$ = J^ d \ = const > ( 3 - 9 °) 
6(1 + r v ) 

* = $ ( 1 - Vv ) = const . (3.91) 

The gauge invariant curvature perturbation ( is given by (2.27, page 28) and it can be 
rewritten as 

3 r . o d r r v CH\ , 2l 



yielding for the neutrino entropy mode 

C = - J1 -^- ; r = const . 3.93 

This results agree with the power law solution found by Bucher et al. (2000), which they 
called "neutrino isocurvature density" mode; we prefer however to term this mode "neutrino 
entropy", since the initial curvature perturbation does not vanish, and indeed is of the same 
order as the entropy perturbation. 

3.3.3 Neutrino velocity mode 

The mode with N v ^ has vanishing entropy at early times, since S V1 — > for a — ► 0, but 
the bulk velocity difference between neutrinos and photons in non-zero, 

k(V u - Vy) = = const (3.94) 

rv 

hence its name. 

From the power-law solution for this mode (see Bucher et al., 2000; Trotta, 2001) we 
expect that the anisotropic stress goes to leading order as IT^ oc a. Indeed, by replacing the 
large-scale solution D = 0, 5„ 7 = N v a on the right hand side of (3.78) we find the particular 
solution 

k 2 n» = -7^h*- (3-95) 
9r„ + 5 

We now use this expression as a source on the right hand side of (3.79) to determine the 
corrections to D, and we can ignore the contribution of the term oc S U1 which goes as a 3 
compared to the part containing 11^, which is dominant, being proportional to a. We thus 
have to solve 

n d 2 _ _ _ 16r„N„ 
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and we find the particular solution 

D 



8r v N v 



3(l + r„)(97v + 5) 

As already noticed in Bucher et al. (2000), the Bardeen potentials are decaying 



4r v N v 



H 

(l + rv)(9rv + 5) \~k 



— a oc a 



(3.97) 



(3.98) 
(3.99) 



but this does not necessarily mean that perturbation theory breaks down for a — > 0. In 
general, a solution is considered non divergent if it is possible to find a gauge in which all the 
perturbation variables do no diverge in the limit a — ► 0. The synchronous gauge potentials 
for the neutrino velocity mode are indeed non-singular at early times (Bucher et al., 2000). In 
fact, even though the Bardeen potential diverge, the gauge invariant curvature perturbation 
C vanishes to leading order. This is most easily seen by making use of Eq. (2.32d, page 29), 
finding 



c 



1 



> + <£>) = o, 



(3.100) 



and thus the velocity mode is indeed an isocurvature mode. 

The leading order corrections to S mi = induced by the neutrino modes can be obtained 
as particular solutions to Eq. (3.69, page 57), which for early times reads 



2 d 



_d_ 

da m7 



_d_ 

da " 7 



k 



HJ 3(1 + r„) 



S, 



1/7 ■ 



(3.101) 



Summarizing, the early time solutions for neutrino entropy (Nd / 0) and neutrino isocurva- 
ture velocity (N v ^ 0) initial conditions are: 



Neutrino entropy 



S, 



1/7 



D 



k 



r v N, 



U) 9(1 +r„) 
2 



d oca 2 



S, 



W17 



k 



r v N d 



U) 12(1 + r„) 



oc a 



1 k 

kV = -— * oc a 
2H 

k- 2 



k z U 



r v N* d 



U) 3(7^ + 5) 



oc a 



6(1 +r„) 



const 



* = $ 1 



2r h 



l + r u 



7r u + 5 
1 r u 
4 6(7r„ + 5) 



const 



Neutrino velocity 



S, 



N v a 



D 



3(1 + ^(9^ + 5) 



S 



m-y 



aN v 



kV = — f 

n 



k 2 U„ 



const 



8N„, 



const ( = . 



9r„ + 5 
4r„JV« 



ft 

(1 + ^(9^ + 5) \k 



(3.102) 



— a oc a 
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3.3.4 The divergent nature of the anisotropic stress mode 

One could ask whether it would be possible to excite a growing "neutrino anisotropic stress 
mode", characterized by initial conditions D = S V1 = S mi = V v -y = V mi = and U u / 
for a — ► 0. Even though highly exotic, such a mode, if it existed, should be included if we 
want to consider the most general type of perturbations. We now show that this mode is 
divergent in all gauges, and therefore is non-physical, since it would lead to the breakdown 
of perturbation theory for a — > 0. Alternatively, we can see it as a decaying mode, which 
therefore does not need to be considered since it quickly disappears. 

Consider the anisotropic stress equation (2.51, page 32) with U u = Hq = const on the 
right hand side, 

# = $ r J- -H 2 n . (3.103) 

(l + r v )(l + a) V ' 

Since TL = r]^ 1 to leading order for a <C a eq , it follows that if oc r/~ 2 . The fact that the 
Bardeen potential diverges at early times is not by itself sufficient to discard the corresponding 
mode, as we have seen in the example of the neutrino velocity mode. A necessary condition, 
however, is the existence of a gauge in which all of the perturbation variables constructed 
out of A, B,C, E,5,v,ttl are non- diver gent. For the neutrino velocity mode, this gauge is 
the synchronous gauge. Clearly, since * is a gauge invariant variable, by construction it 
does not change under a gauge transformation but the variables A,B,C,E do, according to 
the transformation laws (2.18, page 26). If we expand in a Laurent series around r\ = the 
definition of Vl/, Eq. (2.25b, page 28), and we allow terms rf 1 with exponent n > —2, because 
of 7i = 1/rj we obtain to leading order 

A = ^ckt]- 2 . (3.104) 

In other words, in the radiation dominated universe a metric perturbation of the form A cx 
7]~ 2 is gauge invariant. This can also be seen directly from the transformation law for A, 
Eq. (2.18a, page 26): the part TLT+T does not contain terms oc rj~ 2 if T is written as a Laurent 
series in r\. We conclude that Ilo / induces a divergence of A for early times, which does not 
disappear in any gauge. One could conceive to combine A with other diverging variables to 
construct via cancellation a non-diverging metric variable: this however would unavoidably 
produce divergent terms in the matter variables. Therefore a neutrino anisotropic stress mode 
is always decaying in all gauges. 

In principle, there is a whole hierarchy of modes coming from setting Q v ^ for i > 3 
as initial conditions in the neutrino Boltzmann hierarchy. As we noticed in § 2.2.4, higher 
order moments are coupled to the potentials and to the velocity and density perturbations by 
successive powers of kr\. By reversing the argument, we see that 0^_! = O (Q^/krj) implies 
that in the early Universe and on super-horizon scales, krj 1, choosing = 0(1) for I > 3 
would produce divergent behavior in the lower-order multipoles of the hierarchy. Since for 
£ > 2 the multipole moments are gauge invariant, it follows that there is no gauge in which 
such a mode is growing. In summary, the adiabatic and the general isocurvature modes 
presented above constitute the most general type of perturbation. 
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3.4 The role of baryons 

In this section, we go back to the model of a Universe containing dark matter and photons, 
and refine the treatment given in § 3.2 by taking into account the role of baryons in the 
dynamic of the oscillations. For simplicity, we neglect the corrections induced by the neutrinos 
anisotropic stress, omitting neutrinos entirely 

Before recombination photons interact with electrons via Thomson scattering (see section 
2.2.5). The time-scale for the scattering process is set by the Compton scattering time f , 
which represents the typical time between two collisions. Tight coupling is an expansion in 
powers of f , assuming that the scattering rate is rapid enough to equilibrate changes in the 
photon-baryons fluid, and in this limit moments t > 2 in the photon distribution function are 
suppressed by successive powers of f -1 . Therefore to lowest order the photon distribution 
function is described by its zeroth and first multipoles only, and we can set II 7 = 0^>3 = 0, 
which justifies the approximation taken in the previous section. Therefore the truncated 
Boltzmann hierarchy (2.127, page 42) gives for photons 

t) g>1 + \k 2 V = 0, (3.105) 

(J 

% - -D gn - 2$ = -acT T n e (U 7 - V b ) . (3.106) 

To ensure conservation of the total momentum, we need to supplement the conservation 
equation for baryons with the Thomson drag force term coming from the scattering process, 
obtained as the first moment of the collision term 

F?™z = aa T n ePl J ^-n,C [/] . (3.107) 

The momentum conservation for baryons, Eq. (2.60, page 33), therefore gives 

D g , b + k 2 V b = (3.108) 
1 

R 

and we have defined R = 3p&/(4/9 7 ), which can easily be estimated 



V b + HV b -$ = --aa T n e (V b - V y ) , (3.109) 



R„ (**)(*!?.). (3.110) 

\i + zj Vo.022 J v ' 

The set of Eqs. (3.105-3.106) and (3.108-3.109) describes the evolution of perturbations 
for the tight-coupled photon-baryon fluid, while the dark matter component enters via its 
influence on the gravitational potential To lowest order in 1/f , collisions force the baryons 
and photons velocities to coincide, Vy = Vb, which via Eq. (3.22, page 50) implies Sby = 0, 
hence the entropy per baryon is conserved. 

Equations (3.105, 3.106 and 3.109) can now be combined into the equation of a damped, 
forced harmonic oscillator: 



A r 

d.77 



k 2 4 



(l + R)D g „ +—D gn = --(2 + R)k z <f>. (3.111) 



By comparing with Eq. (3.42, page 53), we see that baryons have two effects: they change 
the effective mass of the system (factor (1 + R) on the left hand side) and they displace 
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the zero point of the oscillation by adding to the potential Both modifications are a 
consequence of the fact that baryon add to the mass of the system but not to the restoring 
pressure, which is still given by the photons alone. 

The time dependence of R is of the order of the Hubble time, hence large compared 
to the time scale of one oscillation. For illustrative purpose, we can then neglect the time 
dependence of R and obtain from Eq. (3.111) 

D g „ + c 2 k 2 D gn = -4(2 + R)c 2 k 2 $ , (3.112) 

where the sound speed of the coupled fluid is c 2 = 1/(3(1 + R)). At early times, c 2 — ► 1/3, 
as appropriate for radiation, while at late times c 2 ~ 0, when the universe is dominated by 
matter. The homogeneous solution is still a superposition of sine and cosine oscillations, but 
adding the baryons slows down the period by decreasing c 2 with respect to the pure photons 
fluid. This is responsible for a shift in the acoustic peak positions and for a larger distance 
between the peaks in the CMB power spectrum, see the explanations regarding the role of 
the shift parameter on page 89. 

The adiabatic solution (3.56) becomes 



kV 1 



4 
3 

i+i?y /2 



D gn = -(1 + R)$ cos{c s krj) - 4(2 + R)$ , (3.113) 



— J <5>sm(c s kr]) . (3.114) 



The amplitude of the cosine oscillation has increased by a factor (1 + R), and the potential 
well has deepened by an extra factor (1 + R/2). This displacement of the zero point of the 
oscillations induces a boost (decrease) of the odd (even) peaks in the power spectrum some- 
times denotes as "baryon driving", which is discussed in § 4.1.2.2 and shown in Fig. 4.6 on 
page 91. Finally, the amplitude of the velocity oscillation becomes smaller, since it is sup- 
pressed by a factor c s with respect to the density and c s is smaller in the presence of baryons. 
This leads to a suppression of the Doppler contribution to the acoustic peak structure. From 
Eq. (3.110) we obtain that at the moment of decoupling, Zdec ~ 1100, we have R « 0.6. 

The solution to (3.111) for time-dependent R can be found in the WKB approximation (Hu 
& Sugiyama, 1995a), in which case the qualitative picture sketched above slightly changes: 
the sound speed becomes k j c s dr/, while the amplitude of the oscillations grows in time as 
cj 2 ■ This can be seen simply by considering the quantity mu>A 2 , which for an harmonic 
oscillator is an adiabatic invariant: since in our case the effective mass m = (1 + i?) 1//2 
decreases in time, it follows that the amplitude A ~ (1 + i?) _1//4 ~ cj 2 . 



3.5 Damping 

In the above discussion, we have neglected the fact that recombination takes a finite time 
to complete, and the acoustic oscillations are not frozen instantly. This "finite thickness" of 
the last scattering surface has a twofold effect: photon diffusion and cancellation. Diffusion 
damping arises because of the imperfect coupling between photons and baryons, so that 
photons diffuse out of over-dense into under-dense regions and erase fine scale anisotropies; 
cancellation occurs for scales which have the time to oscillate through recombination, so 
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that the effect of photons that last scattered on a crest of the oscillation is cancelled by 
the contribution of the photons coming from a trough. Cancellation produces a power law 
damping of the fluctuations (Hu & Sugiyama, 1995a), while diffusion damping is exponential 
and is by far the dominant effect, and the one to which we now turn our attention. It is often 
referred to as "Silk damping" (Silk, 1968). 

In view of obtaining a dispersion relation u(k) for photons accurate to first order in f _1 , 
we look for solutions of the form V 1 oc expz J uj6j]. At this order we need to include the 
photon anisotropic stress, which to first order in f _1 from Eq. (2.127c, page 42) is given by 
(neglecting polarization effects) 

16 

n 7 =f" 1 — Vy. (3.115) 

9 

Using the anisotropic stress equation (2.51, page 32) we can substitute in the dipole 
equation (2.127b, page 42) $ = ^ + Tt 2 Ily. However, we assume that the oscillation time 
scale is much shorter that the expansion time scale, i.e. lu^ 1 <C Ji~ , so that we can neglect 
the term W 2 I1 7 in the photon dipole. By the same token, in the following we also neglect all 
time dependencies of the potentials and of R compared with the oscillation time scale. 

We now expand the baryon momentum conservation equation (3.109) up to second order 
in t , and find, under the above assumptions 

V b = V 1 - i^RiituV^ - $) - f- 2 {Ruj) V 7 + 0(f ~ 3 ) • (3.116) 



Inserting this into Eq. (2.127b, page 42) we obtain 



tw(l + R)Yy = -D gn + (2 + R)$ - f"V 7 



v ' 27 



(3.117) 



To lowest order in f _1 we have found in § 3.4 that the quantity jD gn +(2+R)& oscillates with 
the same frequency as Vy, see Eq. (3.112). Therefore we set \D gn + (2 + R)& oc expz f u>dr], 
and using the photon monopole equation (3.105) we arrive at 



3{1 + R) 



+ IT 



l + R 



R 2 cu 2 + -k 2 



(3.118) 



To zeroth order we find as before lo 2 = A; 2 /[3(l + R)], which we can use to obtain the first 
order solution 



k 



+ IT 



k 



vWTRj 6(1 + R) 



R 2 



(l + R) + 9 



(3.119) 



The imaginary term in the frequency induces an exponential damping of the oscillatory 
solutions of the form exp( — k 2 /^), with the characteristic damping scale given by 



1 

6? 



R 2 



+ 



8 



(1 + tf) 2 9(1 + R) 



dr] , 



(3.120) 



Including polarization effects via Eqs. (2.132, page 42) and (2.136, page 42) would increase 
the damping, by changing the numerical factor 8/9 in the above equation to 16/15. 
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3.6.1 Temperature fluctuations 

We now calculate the fluctuations in the CMB photon temperature on the sky. When the 
photon mean free path becomes larger than the horizon scale, 1/f S> 1/H, the Universe 
becomes transparent and photons propagate along null geodesies (free streaming regime). 

In this section we calculate the photon temperature today with the line of sight method: 
we formally integrate the Boltzmann equation along the photon path, and obtain the tem- 
perature measured today as an integral over a time dependent source term. This approach 
includes in principle all the effects due to imperfect photons-electrons coupling and reion- 
ization as well, and it is the core of the fast numerical algorithms for the integration of the 
photon Boltzmann equation, such as CMBfast (Seljak & Zaldarriaga, 1996). Another deriva- 
tion of the same result based on a more physical understanding of the free streaming regime 
can be found in Durrer (1990). 

Consider the collisional Boltzmann equation for the photons temperature @(rj, k,fj,= k-n) 
(were we neglect polarization) 



+ ikfiQ + ikpL^ + $) = -f 



e + tfikv b - e - -^ 2 e 2 



and denote with 



r(r?) 



'/o 



fdfj 



the total opacity from the time rj until today. Using the equality 

d 



e ikfj,ri e ~T 



+ iknQ + 1 



we obtain from (3.121 
rno 
Jo 







e ik\i{r\-r) ) e - 



T ( tfiWb - ©0 - -^202 ) + »fyt(* + $ ) 



(3.121) 



(3.122) 



(3.123) 



(3.124) 



The second term on the right hand side can be integrated by parts and we drop the 
boundary term, which contributes only to the monopole and is thus unobservable, obtaining 



0(770, fc,//) 



'/(i 



dr]e 



tkfj,(ri-rio) 



g(rj) [-tfikV b + 0o + iP 2 02 + * + $] 

to 



+ / dr]e 
10 



tkfj,(r)-Vo) 



(3.125) 



e - r (^ + $), 



and we have defined the visibility function 

g(rj) = f e~ T 



(3.126) 



Equation (3.125) is an integral system of equations, since moments t < 3 of the photons 
temperature appear on both sides. However, the left hand side can be determined given the 
time evolution of an handful of quantities which act as a source on the right hand side: the 
photons moments £ < 3 are calculated from the Boltzmann hierarchy (2.126, page 41), the 
baryon and CDM velocity and density perturbation from the fluid conservation equations 
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(2.62-2.63, page 33), while the Bardeen potentials follow from the Poisson equation (2.49) 
and either the constraint equation (2.50) or the anisotropic stress equation (2.51, page 32). 
Neutrinos can be included via a collisionless Boltzmann hierarchy, Eq. (2.90, page 36). The 
great advantage is that only the first few moments of the collisional Boltzmann hierarchy for 
photons need to be computed accurately in order to obtain the sources of (3.125), reducing 
the number of coupled differential equations which needs solving from several thousands to a 
few dozens. This line of sight integration approach is the core algorithm of all modern codes 
for the computation of the CMB power spectrum (Seljak & Zaldarriaga, 1996). 

The visibility function g(r])dr] in (3.125) encodes the information regarding the ionization 
history of the Universe, and can be interpreted as the probability that a given CMB photon 
was last scattered between r\ and r] + dry. The sharp drop of the free electron density n e 
at decoupling makes the visibility function sharply peaked around rj c \ ec , cf. the solid line 
in Fig. 6.15. When the Universe is reionized at later time, the visibility function becomes 
non-zero again, and the free streaming regime goes once again over in a collisional regime 
(§ 4.1.3.2). 

In the limit of instantaneous recombination, the LSS becomes infinitely thin and the 
visibility function a delta function peaked at r/dec, while we can approximate e~ T with the 
Heaviside step function u{rj — r/dec)- In this limit, the tight coupled fluid approximation 
for photons goes over directly to the free streaming regime, and there is no generation of 
photons anisotropic stress nor polarization. Performing the time integral of (3.125) and 
setting to zeroth order V& 



V-y we find 



G(?7o, k, (j) = r e (osw) + @(d p i) + @(isw) 



(3.127) 



where 
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(3.129) 
(3.130) 
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The temperature fluctuation consists of three terms: 

• The ordinary Sachs- Wolfe (OSW) part, 0( osw ) . The photons temperature monopole 
6o on the last scattering surface, together with the potential terms $ and reflect 
intrinsic inhomogeneities in the radiation fluid and in the metric at the moment of de- 
coupling. On large scales, the ordinary SW effect is responsible for the S W plateau in 
the temperature power spectrum, while on intermediate scales the oscillations of D ga 
produce the familiar peak structure. 

• The Doppler term 0( Dpl ) oc kVb arises because of the relative velocity of observer 
and emitter. Its contribution shows up on the acoustic peak scale. 

• The integrated Sachs- Wolfe (ISW) effect produces the term @( ISW ), and it is 
induced by a time dependence of the Bardeen potentials along the path of the photons. 
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The early ISW effect is due to the fact the the universe is not completely matter 
dominated at recombination and therefore the potentials are not exactly constant; the 
late ISW is generated when the late universe becomes dominated by the curvature or a 
cosmological constant term, both of which induce a time dependence in the potentials. 

The dependence of the anisotropies on the cosmological parameters is presented in § 4.1. 
3.6.2 Angular power spectra 

The relevant quantities for the comparison of theoretical models and observations are the 
temperature and polarization angular power spectra, which we introduce in this section. We 
refer the reader to § 5.1.1 for precise definitions of the terminology. We denote by (•) the 
theoretical ensemble average over realizations. 

Temperature power spectrum 

The temperature fluctuation in direction n on the sky measured by an observer today (770) 
and here (xo) is a superposition of plane wave contributions (in a flat Universe) 

9(7/0, xo, n) = — f d 3 fc 8(770, k, n)e lx ° k (3.131) 

and each Fourier mode can be expanded in spherical harmonics on the 2-sphere as 

00 1 

6(770, k, n) = ^ ^2 <%n(k, 77o)y^ m (n) , (3.132) 

1=0 m=-t 

where the expansion coefficients a^ m (k) are given by 

a £m (k) = J <m n Q(k,n)Y im (n) (3.133) 

= 47r0 < (k)Y An (k) . (3.134) 

In deriving the last expression we have expanded the temperature fluctuation in Legendre 
polynomials as in (2.84, page 36) and used the addition theorem and orthogonality relation 
for spherical harmonics: 

JL of _i_ 1 

£ lW(n)Y/ m (n') = -^-Pl(n ■ n') , (3.135) 

n=-t 

d^ n Y m (n)y £ T m ,(n) = Sw8mm> ■ (3.136) 



/ 



We can perform the harmonic expansion (3.132) directly in real space rather than in Fourier 
space, with coefficients a£ m (xo) (for which we will neglect the argument xo from now on), 
obviously related to a^ m (k) by 



f d 3 k^ m (k)e lkx ° . (3.137) 



tm (2vr)3/2 

We are interested in the 2-point temperature correlation function C on the sky between 
two directions n and n'. By choosing our coordinate system in such a way that the direction 
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n corresponds to the z-axis, and introducing spherical coordinates we can write n' = (0,1?) 
and n • n' = cos(i?). If we assume statistical homogeneity and isotropy for the random 
field 0, see § 5.1.1, then the correlation function does not depend on the observer's position 
(homogeneity) nor on the azimutal angle (f> (isotropy). Therefore 

= f £(2^+lX^(n- n '), (3 ' 138) 

t 

where we have defined the CMB angular power spectrum by 

(a£ m ■ a £'m') = fiwdmm'Ce . (3.139) 

The fact that Cg does not depend on x is a consequence of the assumption of homogeneity, 
while isotropy requires that it does not depend on the index m, which would introduce an 
azimutal dependence. It is also customary to assume that the a^m's are Gaussian random 
fields, as motivated by inflation, but this is not strictly necessary at this stage. Eq. (3.138) 
shows that the angular power spectrum is the harmonic transform of the correlation function 
on the 2-sphere and for Gaussian variables it contains the full statistical information. If the 
aim's are Gaussian distributed, then the Fourier coefficients d£ m (k) are Gaussian random 
variables as well. From the assumption of homogeneity it follows that (a^ m (k)) = 5 (D) (k), 
where 5 (D) denotes the Dirac delta function. Homogeneity and isotropy together imply that 

(km|2) = WfS ^ ahn ^ • ( 3 - 14 °) 

We now relate the angular power spectrum to the temperature multipoles: this is done 
by observing that the evolution equations (2.126, page 41) for Qg are independent of k, and 
therefore we can write 

e*(77,k)=e*(77,A0x(k), (3.141) 

where we assume that x(k) are the Fourier components of a Gaussian, isotropic and homo- 
geneous random field. As a consequence 

( X (k) • X *(k')> = $™(k - k') (| X (A;)| 2 ) . (3.142) 

Now from (3.139) and using Eqs. (3.141), (3.140) and (3.134) we obtain 

/dk 
— P x (k)\@ e (r, ,k)\ 2 . (3.143) 

We shall later identify x with the primordial curvature or entropy perturbation, see Eq. (4.5, 
page 79), and call 

P x(k) = ^(\X\ 2 ) (3.144) 

the curvature (or entropy) power spectrum: this quantity gives the contribution to Cg per 
logarithmic fc-interval of the primordial fluctuation. 

The photons transfer function 0^(?7o, k) in Eq. (3.143) above is an intrinsically 2-dimensional 
quantity which gives information about how the initial power is mapped onto the angular 
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power spectrum. It can be evaluated from Eq. (3.125, page 67), by observing that the angle 
fx = k • n in the integrand can be eliminated by replacing 

d 

dry 



o lk Kv-vo) 



AKv-vo 



gv b 



O ikii{r)-rio\ 



gv b 



g tkfi{ri-r]o) 



gVb 



(3.145) 



and dropping the total derivative which only gives an unobservable monopole term. Therefore 
we can rewrite (3.125, page 67) as 

9(7/0, k, n) = [ V0 dr ] e tk ^ r >-vo) S ( r]i fc ) ( 3 146 ) 

Jo 

with the source term of the form 



^ + e -^ + ^ + $ 
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(3.147) 



+ e~ r (^ + $). 

Now we expand the plane wave in radial and angular eigenfunctions, Bessel functions and 
Legendre polynomials respectively, using the Rayleigh formula 

tk^i(ri-T)o) 



Y / i i (2£ + l)j £ (k( Vo - V ))P £ ( f j,), 



(3.148) 



and we obtain for the temperature transfer function 

@i(Vo,k) 



dr]S(rj,k)j e (k(r] - rj)) 



(3.149) 



This is shown in the top panels of Fig. 3.1 for adiabatic and isocurvature CDM initial condi- 
tions. 

Together, Eqs. (3.149) and (3.143) allow the computation of the CMB angular power spec- 
trum and neatly split the geometric effects from the physics: all of the dynamical evolution is 
encoded in the source function S(rj, k), while the Bessel function accounts for the projection 
from 3-dimensional /c-space on the 2-sphere. The generalization of this result for the K. ^ 
case can be found in Zaldarriaga et al. (1998); Zaldarriaga h Seljak (2000); Lewis et al. 
(2000). The temperature and E-polarization spectra of a concordance model for adiabatic 
and isocurvature CDM initial conditions are displayed in the top left panel of Fig. 4.9 on 
page 94. 



Polarization power spectrum 

As mentioned in § 2.2.5.5, polarization of scalar modes is conveniently described by the E 
polarization mode, supplemented by the cross-correlator between E and T (temperature). As 
for temperature, we can formally integrate the Boltzmann equation for the Stokes parameter 
Q, Eq. (2.131, page 42), along the line of sight and obtain 

9^0, k, M ) =~l£° e tk ^-vo) g{r]) (1 _ p 2 ) (@ 2 + @ Q_ . (3 . 150) 

The E-polarization power spectrum and the ET-correlator (superscript C) are defined as 

(af m ■ a*(F m ,) = 5e£'S mm 'Cf , (3.151) 

( a Jm " a l'm') = fiu'dmm'C't , (3.152) 
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Figure 3.1: Temperature (top) and polarization (bottom) transfer function 67(770, fc) and 
Af(r]o,k) for adiabatic (left panels) and isocurvature CDM (right panels) initial conditions. 
The color scales are arbitrary, and have been chose as too highlight the features of the transfer 
functions. In particular, the color coding is not in scale between the different plots. 



and in analogy with the treatment for the temperature spectrum they can be computed as a 
superposition of k modes of a source function integrated over time: 

Cf = ^Jy P x( fc ) l A f (vo, k)\ 2 , (3.153) 

Af fa>, k) = ^1±^J^° d V S E ( V , k)j e (k( V0 - r,)) , (3.154) 

The cross-correlator spectrum is computed using (3.149) as 

Q c = 4vr j ^P x (k)Q* e (rio,k)Af(jio,k). (3.156) 

The polarization transfer function Af(r)o,k) is plotted in Fig. 3.1 for adiabatic and isocur- 
vature CDM initial conditions. 
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The degree of polarization is proportional to the magnitude of the temperature quadrupole 
at last scattering. Since during the tight coupling regime the temperature quadrupole cannot 
grow, polarization is generated in the relatively short transition between the strong coupling 
and the free streaming regime. To first order in f -1 , the temperature quadrupole is pro- 
portional to the temperature dipole, see (4.30, page 83). The polarization amplitude is thus 
proportional to the temperature dipole at recombination times the width of the last scat- 
tering surface (Zaldarriaga & Harari, 1995), resulting in a polarization signal two orders of 
magnitude lower than the temperature signal. 

3.6.3 Matter power spectrum 

Let 6(rj, x) denote the real-space density contrast in the matter component in the comoving 
gauge; hence 6 corresponds to the gauge invariant variable A m defined in Eq. (2.37, page 30). 
We will drop the time dependence when not needed, and write 6 instead of A m to simplify 
the notation. For clarity, the Fourier transform of the variables is denoted by a subscript "k", 
in this section only. 

The real space correlation function is defined as 

£(r) = (5(x)-S(x + r)), (3.157) 

where (•) denotes an average over realizations, see § 5.1.1 for precise definitions. It is the 
expectation value of 62 = <5(x + r) and 61 = <5(x) under the 2-point probability distribution 
function for 61,62- We write 5(x) as 

*(x) = ^ / d 3 k^ k * (3.158) 

where we denote by 6 k the Fourier transform (in flat space) of 5(x). We postulate that <5(x) 
is a Gaussian distributed, isotropic and homogeneous random field, see § 5.1.1, and therefore 
the quantity (<5£ -<5k') vanishes for k ^ k' (homogeneity) and it only depends on the modulus, 
not the direction of k (isotropy) : 

(6* k • 6 k ,) = (27rf/ 2 6^(k - k') P m (k) (3.159) 

where <5 (D) denotes the Dirac delta function. We call P m {k) the matter power spectrum. 
Replacing (3.158) in (3.157) we obtain 

CM = — ^ / d 3 kP m (k)e^ = J dkk^P m (k) , (3.160) 

showing that the correlation function is the Fourier transform of the matter power spectrum. 

Our aim is to compute the power spectrum today as a function of the spectral distribution 
in the early Universe in the adiabatic CDM scenario. To this end, we make use of the results 
of linear perturbation theory presented in the previous sections for the growth of matter 
perturbations in a Universe containing CDM and photons only. Clearly, these computations 
are valid only as long as the scale considered is in the linear regime, i.e. 6 k <C 1. We only 
sketch the elements which are needed in the following, referring the reader to e.g. Peebles 
(1980); Padmanabhan (1993); Liddle & Lyth (2000) for a full account. 
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Perturbations Sk over a comoving length A ~ k^ 1 behave differently depending whether 
they are outside (k < TL) or inside (k > TL) the Hubble length. For a given scale k, we 
denote by rj ent the time at which that scale crosses inside the horizon, i.e. TL{rj ent ) = k and 
by k eq the wavelength which enters the horizon at the time of matter-radiation equality, i.e. 
k e<l = TL{j] eq ). We thus need to distinguish two cases: scales k > k eq enter the horizon in the 
radiation dominated epoch, while k < k eq enter the horizon after matter domination. We 
shall restrict ourselves to length scales A which are large enough not to be wiped out by free 
streaming, i.e. A > Aps, see Padmanabhan (1993) for details. 

For k > /c eq and r/ ent < rj < ry eq , 5k(rj) stays approximately constant after horizon crossing 
because the radiation dominated epoch suppresses the growth of perturbation in a dust-like 
component; this is called the Meszaros effect (Meszaros, 1974). For rj > rj eq the Universe is 
matter dominated and the situation is analogous to the single fluid case examined in § 3.1, 
and the perturbation grows as 5^ oc a, see Eq. (3.14, page 49). Wavelengths which enter 
the horizon in the matter dominated epoch, k < & eq , start growing as soon as they cross the 
horizon, S^rj) oc a for rj > ry ent , by the same argument given above. Summarizing, we have 
that 



^k(?7 > Vent) oc < 



(Sklent)— for k > fc eq 

a? a . ( 3 - 161 ) 

£k(f?ent)— — — for k < k eq 



and therefore we know <5k for all subsequent times once we specify 6u_(r] ent ), the value of the 
density contrast for the wavelength k at the moment when that wavelength crossed inside the 
horizon. Since for a given wavelength rj ent oc 1/k, horizon crossing happens at a different time 
for each scale. We notice that in the second line of Eq. (3.161) we can rewrite the factor 

^eq/^ent 

\ 2 / T \ 2 
Veq \ I \ („2 



— otk z , (3.162) 

Vent / V. "'eq / 

where in the first equality we have used the fact that a oc rj 2 in the matter dominated universe. 

Given that the range of scales of cosmological interest is not too wide, we can make the 
following power law Ansatz for the scale dependence of the perturbation at horizon crossing 

S k (Vent) = Ak~ a . (3.163) 

An important quantity is fc 3 /(27r) 3//2 P m (/c), which from (3.160) gives the contribution per 
logarithmic A:-interval to the real space correlation function, and which with the above Ansatz 
evaluates to 

A; 3 



(2vr) 3 /2- 



P m (k) oc k 3 ~ 2a = const for a = 3/2 . (3.164) 



Vent 



This quantity can also be interpreted as the variance of the mass contained in spheres of 
diameter A ~ 1/k at horizon crossing, see e.g. Padmanabhan (1993); for the value a = 3/2 
the variance is the same on all scales. 

We might prefer to specify our Ansatz not at horizon crossing, but rather for some fixed 
initial time (the same for all scales) rfc. In order to relate 5k(^«) with 6k(i] Bnt ), we notice 
that on super-horizon scales k < TL and for times r/ ent > r\ > r] eq we have 5^ oc a oc rj 1 from 
Eq. (3.14, page 49). For the case k < TL in the radiation epoch, rj < r) ent < rj eq we can use the 
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adiabatic solution (3.26, page 51) and the relation 

s * = Am = + i^r s ~ i D + s a fl2 K ^ - ( 3 - 165 ) 

4 + da 4 + 3a 4 

and the approximation is valid for a < a eq . In conclusion, the comoving dark matter density 

contrast grows as rf at all epochs while outside the horizon. Therefore we obtain (with 
^ent > Vi f° r an scales of interest) 

tfkfent) = ( ~ ) foiVi) « ^~ 2( ^k(??i) • (3.166) 



It is customary to make a power law Ansatz for the matter power spectrum at the time rji of 
the form 

P m {k, m ) = Bk n (3.167) 
and by the relation (3.166) the index n is related to a by 

n = -2a + 4. (3.168) 

The value a = 3/2 which yields a constant-mass- variance on all scales at horizon crossing 
corresponds to n = 1, the so-called "scale invariant spectral index", also known as Harrison- 
Zel'dovich spectrum (Harrison, 1970; Zel'dovich, 1972). The power spectrum today then 
becomes in terms of n, from (3.161) 

f k n ~ 4 for k > k eq 
<5kM oc { . (3.169) 

[ k n for k < k eq 

The length scale which crosses the horizon at equality, A eq « 13/(f2 m /i 2 ) Mpc corresponds 
to a peak in the power spectrum: fluctuations on larger scales, k < fc eq ~ 1/A eq retain their 
primordial shape, while perturbations on smaller scales have their spectrum multiplied by k~ 4 . 
The above arguments only apply in the linear region, i.e. for k<0.3 n/Mpc, above which 
non-linear growth of the fluctuations invalidate perturbation theory and a full numerical 
simulation is required to follow the evolution. 

Finally, we can easily relate the matter power spectrum to the Bardeen potential by 
using the Poisson equation (2.49, page 32). If we consider the value of ^'k(^ent) ) the Fourier 
transform of ^ evaluated at horizon crossing, we have from the Poisson equation, noticing that 
W(r/ ent ) = 1/k, 5k = A m ~ A 7 ~ D by the adiabaticity condition, that ^k(r] ent ) ~ — 5k(rj ent ). 
Therefore for the power spectrum of the Bardeen potential, defined as 

P*^(l*k| 2 > (3.170) 

we have that 

P*(k)\ Vmt oc k z P m {k)\^ oc k n ~ x , (3.171) 

and the n = 1 scale invariant spectrum corresponds to P^(r] ent ) = const. Or we can specify 
Pq, at a fixed initial time rji, in which case we obtain again from the Poisson equation 

P*(k)\ vi oc k~ l P m {k)\ m oc k^ 1 . (3.172) 
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The fact that there is no evolution in the power spectrum of until horizon crossing is 
of course a consequence of the fact that « const on super-horizon scales, as shown in 
§ 3.2. The same scaling applies for the power spectrum of the gauge invariant curvature 
perturbation (, which is constant on super-horizon scales for adiabatic perturbations, and 
proportional to ^f. 



Chapter 4 

Parameter dependence 



This chapter presents a brief review of the dependence of the CMB power spectra on the 
standard cosmological parameters and on general initial conditions, building on the results of 
the previous sections. Understanding the impact of the parameters on the observable spectra 
builds the framework for parameter extraction from data, which is the subject of Part III. 

In § 4.1 we concisely review the origin and main parameters dependencies of well known 
features of the power spectrum: the large scale Sachs- Wolfe plateau, the acoustic oscillations, 
and the damping tail. Introductory reviews on this topic can be found in e.g. Kosowsky 
(2002) and Hu et al. (1997). A detailed physical understanding in a fully analytical approach 
is explained in Hu & Sugiyama (1995a, b, 1996). In view of efficient and accurate parameter 
estimation, fundamental degeneracies in the CMB spectra are best understood by introducing 
a set of analytical functions of the parameters which the CMB probes directly, and upon which 
the spectra dependence is almost linear (Kosowsky et al., 2002). We call this new basis in 
parameter space "normal parameters set", and we illustrate it in § 4.2. 

In § 4.3 the CMB angular power spectra for general isocurvature initial conditions in a 
Universe containing CDM, baryons, photons and neutrinos are presented. The four modes 
adiabatic, CDM isocurvature, neutrino density and neutrino velocity - along with a baryon 
isocurvature mode which is equal to the CDM mode up to a rescaling constant - span the 
whole space of non-diverging solutions of Einstein's equations at early times (Bucher et al., 
2000), and thus their superposition constitutes the most general type of initial conditions for 
CMB anisotropy. 

4.1 Standard parameters 

The detailed shape of the CMB temperature and polarization spectra depends on the value 
of the cosmological parameters and on the type of initial conditions in characteristic ways. 
However, certain combination of parameters lead to very similar spectra: this causes degen- 
eracies among some parameters, which cannot be reconstructed with CMB alone, but require 
the inclusion of external data-sets. 

Polarization information helps breaking temperature degeneracies because of two charac- 
teristic features: the first is that after decoupling the polarization state is preserved by free 
streaming, and the polarization spectrum is only modified by rescattering due to reionization 
(§ 4.1.3.2). Therefore in a sense polarization is a more clean probe of the decoupling than 
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temperature. The second reason is that while the acoustic peaks in temperature are domi- 
nated by the monopole of the temperature fluctuation on the LSS, the peaks in E-polarization 
reflect the dipole component at decoupling, i.e. the photon bulk velocity (§ 4.1.2.1). 

In the following we revisit the main parameter dependence of the CMB spectra: for the 
sake of illustrating the physical effects involved, we divide the CMB power spectrum in 
three distinct regions, corresponding to different angular separations on the sky with the 
approximate relation •& ~ ir/£. 

• Large scales: on scales larger than the Hubble radius at decoupling, kr]d ec 1, per- 
turbations are dominated by the ordinary Sachs- Wolfe effect, given by the combination 
of the intrinsic temperature fluctuations on the LSS and the gravitational redshift in- 
duced by climbing out of the potential well. In non-flat cosmologies, or models with 
a considerable value of the cosmological constant, the late IS W effect also contributes. 
This region corresponds roughly to the COBE scale, ^30 and 7°. 

Reionization produces a a characteristic increase of E-polarization on large scales, the 
so-called "polarization bump". 

• Acoustic region: inside the sound horizon photon pressure cannot be neglected, 
and scales within the sound horizon k J c s rj > 1 oscillate, while gravitational infall be- 
comes negligible because of potential decay inside the horizon. On intermediate scales 
50 <, I <> 600 the CMB power spectrum displays a rich peak structure, reflecting the 
contributions of density oscillations and Doppler term on the LSS. The early IS W ef- 
fect contributes at roughly the 20% level up to the first acoustic peak (for adiabatic 
models). Those scales have a typical angular separation on the sky ranging from about 
10° down to a few 10'. 

• Damping tail: wavelengths smaller than the diffusion damping scale 1//cd given in 
(3.120, page 66) are exponentially suppressed and this causes a drop in power above 
I ~ 800 or ■&<, 1'. This effect combines with rescattering due to reionization, which also 
erases fine-scale anisotropies. 

4.1.1 Large scales 

We wish to investigate the expected temperature fluctuations on very large scales in the gen- 
eral case of a superposition of primordial adiabatic and isocurvature CDM initial conditions. 
We look at wavelength k <C &dec which at decoupling where still outside the horizon and we 
consider a zeroth order approximation which neglects any anisotropic stress and the baryon 
influence (i.e. set R = 0). If we take decoupling to happen well into matter domination, 
we can also neglect the ISW contribution since the potentials are equal and constant - see 
Eq. (3.14, page 49) - and to this level of approximation we can set = Vy. With this 
approximations we have for each Fourier mode from Eqs. (3.128, page 68) and (3.129, page 



In the adiabatic case, we can neglect the contribution of the Doppler term which behaves 
as a sine and hence disappears on large scales, fc%ec "C 1, while the cosine oscillation of 



68) 




(4.1) 



4.1 Standard parameters 
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the density perturbation D ga becomes constant, see (3.56, page 55). Therefore for adiabatic 
initial conditions, from the solution (3.56) it follows 



8(770, fc,/i) ~ e 4fc ^ec-%) 



;U M D - 2$MD 



(adiabatic), (4.2) 



where $md denotes the value of & at decoupling well within matter domination. On the 
right hand side, the term — 2<I>md comes from the solution (3.56), and its negative sign 
reflects the fact that the temperature is larger inside potential wells ($ < 0), so that photons 
are blushifted when they fall into the well. The term 2$md represents the gravitational 
redshift which photons experience when they climb out of the potential as they free stream 
after decoupling, which exactly cancels the gravitational blueshift term in the absence of 
baryons. In conclusion we have 

9(7/0, ~ e'^^^^^MD (adiabatic). (4.3) 

For isocurvature initial conditions, we have that D 5i7 (r?dec) = 0, which follows from (3.48, 
page 54) with the isocurvature condition $o = 0. The Doppler term can again be neglected 
with respect to the potential, because from (3.53, page 55) we have that kVy ~ k/TC& <C $ 
and (4.1) reduces to 

8(i) )l * l /i)»e ,w,h «" w) 2$ MD (isocurvature), (4.4) 

the well-known result that isocurvature initial conditions produce large scale fluctuations six 
times larger than in the adiabatic case for the same value of the Bardeen potential on the last 
scattering surface. 

More interestingly, we can relate the large-scale temperature fluctuations to the amplitude 
of the primordial curvature and entropy spectra. Rewriting (4.3-4.4) in terms of the curvature 
and entropy perturbations in the radiation era via Eqs. (3.33-3.36, page 53), yields for the 
source term (3.147, page 71) 



S(r],k) = 5(rj - rj dec ) 



(4.5) 



where ip(k) and <f){k) are the Fourier components of random fields which we assume are 
Gaussian distributed, isotropic and homogeneous, see § 5.1.1, evaluated at some initial time 
rji deep in the radiation epoch. For their power spectrum we make a power low Ansatz 



" ' " N |2\ _ a2 ' ^ 



^)U^<iv#r> = Co^) (4.6) 



m)\ v ^^{\m\ 2 ) = Sl(^) , (4.7) 



k 3 , „ / k ^ nc ~ 1 



P c( k )\ Vi = ■ = ^So ( ] cos(A c ) . (4.8) 

The constants Co and ^o are dimensionless and positive, while the angle A c parameterizes 
the correlation between entropy and isocurvature perturbations; the constant fcp is a pivot 
scale, for which a popular choice is kp = 0.05 Mpc -1 , and we have defined n c = (n s + n e )/2. 
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The power law index n s is the scalar spectral index: n s ~ 1 is a generic prediction of 
inflation, almost independently of the particular model, and is called "scale-invariant" or 
Harrison-Zel'dovich (Harrison, 1970; Zel'dovich, 1972) spectral index. The reason for the 
name is explained in § 3.6.3. Since ^ cx C up to constant factors, ^ and C have the same 
spectrum. 

From (3.143, page 70) the angular power spectrum on large scales (£<20) is then given 

by 



Ct =4tt 



dk 
~k 



(4.9) 



X 3i (HVO - Vdec)) ■ 

The integral can be performed analytically provided all the indexes are within the range 
— 3 < nx < 3 and in the approximation k(r]Q — ?7dec) ~ krjo (Gradshteyn 8z Ryzhik, 1965). 
The result is 



Ct = 2vr 



C z 4S 2 4 

g/(n S) £) + -^rf(n e ,£) - -( S cos(A c )f(n c ,£) 



(4.10) 



The function / contains the dependence on the spectral indexes, and it is given by 

/(»■<>= faw^ ^-f^Vi-i) - (4 - 11! 



where T is the gamma function, which for a scale invariant spectrum, n = 1, evaluates to 

f ^ =l ^ = whw)- (4 - 12) 

If both the curvature and entropy spectral indexes are close to scale invariant (n s = n e = 
1), we find that the so-called Sachs-Wolfe (SW) plateau for £^20 is constant: 



-Q = 25C0 + - - cos(A c )Co5 « 10^ 1U , (4.13) 

and the numerical value is the measurement of the DMR instrument aboard the COBE 
satellite averaged on scales < 7° (Smoot et al., 1992). Clearly, uncorrelated entropy and 
curvature perturbations (i.e. with cos(A c ) = 0) both add to the SW plateau, but a positive 
correlation (defined by cos(A c ) > 0) reduces the power on large scales, while a negative 
correlation increases it, as shown in the top left panel of Fig. 4.9 on page 94. If there is no 
correlation, the isocurvature Sachs-Wolfe plateau from (4.3) and (4.4) is 36 times larger than 
the adiabatic one for the same value of ^ at last scattering, and 4 times larger for the same 
amplitude of the primordial curvature and entropy perturbations, Eq. (4.13). In the pure 
adiabatic case, Sq = 0, we obtain from (4.13) an estimate of the primordial amplitude of the 
curvature perturbation: 

Co~5-10~ 5 . (4.14) 

For models with a non-zero cosmological constant, the Universe becomes A dominated for 
a/ao > (Om/r^A) 1 ^ 3 , and the potentials start again to decay. This produces a late time ISW 
which contributes on large scales, where it is dominant with respect to the ordinary SW part 
described above, producing a rise of the SW plateau at low multipoles. The details differ 
considerably for adiabatic and isocurvature models, and also depend on the spectral index, 
see Hu & Sugiyama (1995b) for a detailed explanation. 
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4.1.2 Acoustic region 

The structure of the power spectrum on intermediate scales is the result of several physi- 
cal effects, sometimes with contrasting impacts. The most distinctive features are acoustic 
oscillations and projection. 

4.1.2.1 Peak locations 

Scales kr s = k J^ 60 c s drj > 1 enter the horizon before decoupling and thus D gr/ oscillates as 
cos(r s k) - cf. (3.56, page 55) - for adiabatic perturbations or as sin(r s /c) - cf. (3.62, page 55) 
- in the isocurvature mode. Thus scales which at the moment of decoupling have reached 
an extremum of their oscillation will yield corresponding peaks in the temperature power 
spectrum. Notice that since the power spectrum is a quadratic quantity, both maxima and 
minima of the oscillations give peaks. The k modes which at recombination are at maximum 
compression or expansion are 



kJ^ = — -, r, m = 1,2,3,... (adiabatic), (4.15) 

r s {Vde 



, (m) _ rrnr + 1/2 ^ m = o, 1, 2, . . . (isocurvature). (4.16) 

r s (?7dec) 

The corresponding physical scale A phys = a^ec^/k subtends an angle i? on the sky given by 
the angular diameter distance relation (1.32, page 15), and the peaks in the angular power 
spectrum show up at £ ~ ir/r9 or 

6 m) ~ rrnr— (r? dec ) (adiabatic), (4.17) 

ar s 

£ {m) ~ (I + m)TT — (r] dec ) (isocurvature). (4.18) 

CLT S 

Since D gn (k = fej^ ) < 0, the first adiabatic peak corresponds to a compression maximum, 
while the first "isocurvature hump" is an expansion maximum, D ga (k = k^) > 0. In 
the literature, "first acoustic peak" usually designates the compression peaks, i.e. the first 
adiabatic extremum and the second isocurvature one, which in the notation of (4.15-4.16) 
correspond both to the index m = 1. For a flat universe (K, = 0) without cosmological 
constant = 0) and a baryon content as inferred from BBN (Qf,h 2 « 0.02), the location 
of the first acoustic peak is approximately 

~ 220 (adiabatic) and (4.19) 
£ {1) ~ 330 (isocurvature). (4.20) 

The WMAP data allow a very precise determination of the position of the first peak, 
£v) = 220.1 ± 0.8 (Page et al., 2003), thereby confirming that the adiabatic mode is the 
dominant one. However, subdominant isocurvature contributions cannot be ruled out, see 
Chapter 7. 

The location of the peaks depends on the of initial conditions, but the inter-peaks distance 
is independent on the type of perturbations, and in the above estimate is A£ w 220. The peak 
spacing depends on the baryon content, which sets r s , and on the spatial geometry which 
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enters in Da- A larger baryon content slows down the oscillations, thus decreasing the sound 
horizon and the spacing between peaks grows larger. The dependence of Da is primarily on 
the curvature of the universe: in a crude approximation we neglect fl/c <C £l m and Q\ when 
integrating (1.34, page 15) up to Zdec ~ 1100 S> 1 and neglect ft r as well (which is not a good 
approximation for a large redshift) and we obtain 

DA(z dec ) « ^ m 1/2 • (4.21) 

— 1/2 

Therefore the peak position scales as f2 m , which means that the peaks are shifted to 
larger £ values in an open universe. Introducing a non-zero cosmological constant complicates 
matters, since it is then possible to obtain the same value of the angular diameter distance, 
and hence the same peak location, by compensating a change in Q m with a different value of 
SIa, an effect which goes under the name of angular diameter distance degeneracy (Efstathiou 
&; Bond, 1999; Melchiorri & Griffiths, 2001). The angular diameter distance test is no longer 
sufficient to determine alone the curvature of the universe, but an independent measurement 
of Q m or £l\ is necessary. 

To illustrate this fundamental degeneracy, let us introduce the shift parameter 1Z shlft , 
which gives the first peak's position (in an adiabatic model) with respect to its location in a 
flat reference model with Q m = 1: 

£W =£«/ft shift , (4.22) 

which can be evaluated from (4.17). To this end, we need the explicit expression for the 
sound horizon at decoupling, which is given by 

dr? 

r s {adec) = c s —da 
Jo da 

^dec/ao (4.23) 



i + fg*-x) (n m x + n r + n K x 2 + n A x A ) 



1/2 



(where all the fix's are evaluated today). Neglecting the curvature and cosmological constant 
term in the early universe (adec/oo *C 1) yields the approximate result 



r s (a de c) ~— Yh n — x 



x In 



1 + R eq + 2i?dec + 2 V / (l + R dec )(R eq + R) 



1 + -Req + 2y -R eq 



(4.24) 



where 



(X 

R ( a ) - 77i and ^ = R{a eq ),Rdec = R(a dec ) ■ (4.25) 

In order to find a simple approximate expression for TZ shii \ let us ignore the logarithmic 
dependence on the parameters of r s , and neglect the parameter dependence of the factor 
(aeq/ao^V-Req 2 as well; we shall relax those approximations in § 4.2. Then the sound 
horizon for K, / models scales as 

r s (adec) « otJ , (4.26) 
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while for the reference model with (£l m ,£l\) = (1,0) we have 

-Cyl(adec) 



ec - 1 



2a , (4.27) 



with a being approximately the same factor as in (4.26). For the shift parameter (4.22) of a 
model with arbitrary (J7 m , fi^y) we then obtain the simple expression 



ft«*ift„_2 /M, (4.28) 

where A77 is given in Eq. (1.34, page 15) and x m Eq. (1.3, page 12). This handy expression 
gives the approximate position of the first peak as a function of Q m and Q\, with Q/c obtained 
from the constrain 1 = Q m + J7a + ^a: • Here we have ignored the dependence on the radiation 
content of the model, which is explicitly included in (Eq. (6.5, page 126)). In the left panel 
of Fig. 4.1 we plot lines of 7£ shlft = const in the (fi m >^A) plane, which are not parallel to 
lines of constant curvature (diagonal lines). 

Along with 7£ shlft , two other physical quantities determine the structure of the peaks: 
the baryon density fi^/t 2 controls the relative height of the peaks, see § 4.1.2.2, while the 
amount of matter Q m h sets the redshift of equality, for a fixed relativistic energy content. 
Therefore by fixing the three quantities 1Z shlft , Q. m h 2 , Q^h 2 we obtain models with almost 
indistinguishable power spectra in the acoustic region. This is illustrated in the middle panel 
of Fig. 4.1, where a flat, a closed and an open model result completely degenerate, with the 
only difference showing up on large scales because of the different amount of late IS W effect. 
The right panel shows that conversely the first peak's position in three flat models can be 
very different if the shift parameters differ, and therefore the statement that the first peak 
position alone can determine the curvature of the Universe is imprecise. 

Polarization peaks are displaced by ir/2 with respect to temperature peaks, hence polar- 
ization maxima occur at temperature minima. This can be seen by expanding to first order in 
f _1 the polarization hierarchy (2.132-2.135, page 42), finding for the polarization monopole 
and quadrupole 

e^ = -^e 2 and ef = -^e 2 . (4.29) 

The temperature quadrupole is found to the same order from the temperature hierarchy, 
including the polarization feedback as in (2.136, page 42), giving 

6 2 = -f" 1 — ikQi. (4.30) 
15 

The E-polarization source term (3.155, page 72) becomes in the instantaneous decoupling 
approximation 

S E = -T'Hvo - mec)~ 2 l<3>l(mec) , (4.31) 

showing that E-polarization probes the temperature dipole, i.e. the bulk velocity of the 
photons-baryons fluid, at decoupling. Since 0i oc Vy cx D gr/ we see that polarization os- 
cillations are out of phase of n/2, as visible in the top left panel of Fig. 4.9 on page 94. 
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Figure 4.1: Left panel: lines of constant shift parameter (4.28) in the (Q m ,Qj\) plane (in 
blue) correspond to models in which the acoustic peaks are in the same position; those lines 
are not parallel to lines of constant curvature (in red, the line of O tot = 1 is the locus of flat 
models). Middle panel: a closed (blue, long-dashed), a flat (solid, red) and an open model 
(dotted green) with parameters corresponding to the three colored dots in the left panel on 
the TZ shm = 1.14 line are almost completely degenerate. Right panel: three flat models with 
different shift parameters (and values corresponding to the three colored squares in the left 
panel) exhibit a very different peak structure. In particular, measuring the position of the 
first peak alone is not enough to determine the curvature of the Universe. 



4.1.2.2 Baryon signature 

Let us now examine in more detail the role of baryons in the adiabatic scenario. The relevant 
quantity for the final temperature fluctuations is, from Eqs. (3.128) and (3.129, page 68) with 
$ = f 

\o gi + 2$ - ifikVy =-(l + R)$cos(c s kr}) - (2 + 

VTTR (432) 

+ 2$ - ifi — — <S>sin(c s kr]) , 

v3 

where we have inserted the adiabatic solution (3.113-3.114, page 65) and explicitly restored 
the Doppler contribution. The effect of baryons, R > 0, is twofold: the amplitude of the 
cosine oscillation is larger and the zero point is now displaced to — i?<&, i.e. the gravitational 
effects of falling into and climbing out of the potential at decoupling no longer exactly cancel 
as in Eq. (4.2), where we had taken R = 0. Therefore a larger baryon content enhances 
compression peaks, which correspond to negative extrema of the cosine 1 , while it suppresses 
expansion peaks. This leads to a distinctive signature of the baryon density on the CMB 
spectrum: a larger baryon content boosts odd peaks and reduces the even ones, hence a 
precise measurement of the first three peaks leads to an accurate measurement of the baryon 
content, as is evident from Fig. 4.6 on page 91. 

Up to now we have put aside the Doppler term V-y oc sm(c s krj): the sine is out of phase 
of 7r/2 with respect to the density oscillation, and its maxima fill in the zeros of the cosine. 
In the absence of baryons, this would lead to an exact cancellation and to the disappearance 

1 Note that $ < inside potential wells, thus cos(c s fc?7) < indeed gives D g .j > 0, according to Eq. (3.113, 
page 65), i.e. it corresponds to an overdensity with ST/T > 0. 
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Analytic Contributions 
n t ^0.06 h=0.5 




Figure 4.2: Contributions to the adiabatic temperature spectrum (solid) from the tempera- 
ture monopole (long-dashed), the temperature dipole (Doppler term, short dashed with label 
Gi), and ISW effect (reprinted from Hu & Sugiyama, 1995a). 

of the acoustic peaks: adding the density and velocity term incoherently in quadrature for 
R = gives a constant. However, R > suppresses the Doppler term by a factor (1 + R) 
(in quadrature) with respect to the density term, and the net effect is that the velocity 
contribution partially fills in the minima of the density oscillation without erasing the peak 
structure, as shown in Fig. 4.2. Also the peak structure for the velocity contribution gets 
more washed out by the free streaming conversion than for the density, a consequence of the 
fact that the velocity term is multiplied by /i (Hu & Sugiyama, 1995a). 

4.1.2.3 Early ISW effect 

At recombination, the Universe is not completely matter dominated, since adec ~ 3a eq and 
thus the Bardeen potentials are not exactly constant. This gives an early ISW contribution 
to the anisotropy, which is spread out over a large multipole range, adding in particular to 
the rise from the large scale plateau to the first acoustic peak for the adiabatic scenario, 
cf. Fig. 4.2. Since most of the contribution comes from early times, when rj <C r/o, we can 
approximatively set j£(k(r]o — r])) je(krjo) and write for the ISW contribution to (3.149, 
page 71) 



Jrj dec '/dec 

The early ISW is more prominent if the epoch of equality is delayed due to a smaller 
matter content or to a larger radiation content, for instance in the presence of extra relativistic 
particles, as shown in § 6.1. 

4.1.3 Damping tail 
4.1.3.1 Recombination 

Temperature fluctuations on small angular scales are exponentially suppressed by diffusion 
damping due to the breakdown of tight coupling at recombination, as discussed in § 3.5. The 




(4.33) 
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effect can be roughly incorporated into the undamped solution (3.127, page 68) by multiplying 
it with the damping factor 

V{k) = j dr]g(v)e~ [k/kD{ri)]2 » e-I^D^dec)] 2 j (4.34) 

using the damping length scale k^ 1 of Eq. (3.120, page 66). 

The main parameter dependence of the damping scale is easy to understand physically: 
the matter content sets the horizon scale at decoupling, while the baryon density controls the 
Compton scattering time ~ f~ x . Before recombination, photons diffuse by a random walk 
over a typical length Ad = VN /t, where N is the number of collisions, N ~ rjf. Hence the 
damping length scales as 

Ad ~ Vvdec/i- oc uj m 1/4 u^ 1/2 , (4.35) 
where the last proportion takes advantage of the fact that n e oc uJb (see Eq. (6.17, page 

— 1/2 

137)) and rjdec w m if decoupling happens in the matter dominated era. A more detailed 
estimate is given in Eq. (6.19, page 138), which also includes the effect of the helium fraction, 
which we have ignored here. 

Clearly, when recombination occurs the mean free path goes to infinity very rapidly, and 
therefore the above argument no longer applies, and one has to use a more sophisticated 
analysis. More details and precise fitting formulas for (4.34) can be found in Hu & White 
(1997), while useful fitting formulas for many relevant recombination quantities are detailed 
in Hu fe Sugiyama, 1996, Appendix E. 



4.1.3.2 Reionization 

When the Universe is reionized, the free electron fraction becomes unity again and CMB 
photons can be rescattered. Fairly little is known about the details of the reionization mech- 
anism and its redshift dependence (for a review see Haiman, 2004) but the null detection of 
Gunn-Peterson troughs indicates that the Universe was completely ionized after redshift 6 
(Becker et al., 2001), possibly for the second time (Cen, 2003). The recent WMAP results 
(Spergel et al., 2003) seem to indicate that reionization happened quite early, at a redshift 
z re ps 17, corresponding to an optical depth of r re ~ 0.16 for a standard ACDM model. 

Reionization has two effects on the power spectrum: temperature anisotropies on scales 
below the angle subtended by the horizon at recombination get washed out, and on the same 
scale there is a generation of polarized power. Let us take for simplicity a model in which 
all the hydrogen is suddenly reionized at a redshift z Te , and ignore helium reionization which 
happens around z ~ 3 which only contributes a few percent. Then the corresponding optical 
depth to reionization, r re , is given by 



^re 

to 

cut f Zre n e (z) dz 



(4.36) 



~ H J (1 + z) [^ r (i + z )4 + 0m (! + z ys + q k ^ + z )2 + n A ]i/a • 
The free electron density (per cm 3 ) can be expressed as (see Eq. (6.17, page 137)) 

n e (z) = 11.3 • 10~ 6 (1 - Y p )u b (l + zf , (4.37) 
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where we have included the Helium mass fraction Y p for future reference (see § 6.2.2). For 
a flat Universe (fix. = 0) and neglecting the contribution of radiation, which is a good 
approximation if z Te <C 100, the integral in (4.36) can be performed analytically, giving (Hu 
& White, 1997) 



T re = 4.6-nn 2 (i-y p )-^ ^/n A + n m (i + z re y - 1 . (4.38) 

a L m L 

From the definition of the visibility function g, the probability that a photon last scattered 
between today and redshift z is 

P{z) = [ g(z)dz = 1 - e~ Tiz) , (4.39) 



o 

and therefore the fraction of photons which arrive to us directly from the recombination epoch 
is 1 — P(z xe ) = exp(— r re ). Above the horizon scale at reionization, all photons contribute 
to the anisotropy, while below that scale only the fraction exp(— r re ) which did not rescatter 
contribute. Thus power on small scales will be suppressed by a factor exp(— 2r re ) and the 
reionization damping factor is given by 

f 1 for fcr re <C 1 

Pre(fc) = -2. , , " " (4-40) 

[ e lT ™ for kr m > 1 

The angular scale subtended by the horizon at reionization can be found using (1.32), 
yielding the approximate scaling (Tegmark & Silk, 1995) 



^oca/^. (4.41) 

Without polarization information, reionization is highly degenerate with the spectral tilt 
and a tensor or isocurvature contribution which would add power only on large scales: a 
larger reionization optical depth can easily be accommodated by adding tensors or an isocur- 
vature component an reducing at the same time the overall normalization, thereby exactly 
compensating the reionization power suppression. This degeneracy can be expressed by in- 
troducing a suitable combination of r re and the overall normalization, see Eq. (4.48) and 
compare Fig. 4.7. However, the characteristic signature of reionization is the generation of 
polarized power on the horizon scale of reionization, and the corresponding "polarization 
bump", clearly visible in the bottom right panel of Fig. 6.16 on page 158, around i 20 in 
the E-polarization spectrum can be used to break the degeneracies with other parameters. 

The position and scaling of this bump can easily be understood physically (Zaldarriaga, 
1997): the temperature quadrupole at reionization, which determines the reionization induced 
polarization, is given by the free stream of the temperature monopole at decoupling: 

©2 fare) = (©o + 2$)(r ?dec )j 2 (k(r] ie - r] dec )) . (4.42) 

Given that the /c-oscillation of the monopole is much slower than the one of the Bessel 
function, r s <C rj ie — ?7dec) the first peak corresponds approximately to the maximum of the 
Bessel function, which occurs for k ~ 2/(?y re — 77dec). This translates into t w k(rj - r] Te ) ps 
2(77o — r ]re)/( r ]re — ? ?dec) ~ 2-y/i^T. This peculiar scaling of the position of the reionization 
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bump in the E-spectrum could potentially be used to distinguish the effect of a possible time 
variation of the fine-structure constant, see § 6.3.4. 

Only one parameter is sufficient to characterize the simple model of sudden reionization 
presented above, namely the reionization redshift z Te or equivalently r re ; but it has been 
shown that there are up to five principal reionization modes which could be extracted from 
CMB measurements (Hu & Holder, 2003). Furthermore, it is possible to link the reionization 
history to specific stellar models and try to constrain the parameters of star formation and 
evolution modelling using CMB data (Bruscoli et al., 2002; Holder et al., 2003; Kaplinghat 
et al, 2003a). 



4.2 Normal parameters 

The physical understanding of the characteristic signature of the cosmological parameters 
can be exploited to build a set of analytical functions which describe quantities directly 
probed by the CMB. We call such a set a "normal parameter basis", because the effect of the 
new parameters is almost orthogonal, in the sense that correlations among the parameters 
should be very small. The normal parameter set has the advantage of taking into account 
the most severe CMB degeneracies, such as the geometrical degeneracy described above, 
a feature which improves the efficiency of parameter space exploration (see § 5.1.7). The 
dependence of the CMB spectrum on the normal parameters is almost linear over a wide 
range of values, a very important property which makes them ideal as a basis set for the 
Fisher matrix analysis, see the explanations in § 5.2 and § 6.2.5 for an application. In terms 
of the normal parameters, it is easy to disentangle and understand the physical effects on the 
CMB power spectra of each parameter while keeping the other constant, to the contrary of 
what happens for cosmological parameters. 

We have seen in § 4.1.2 that the shift parameter lZ shlit , the baryon and matter density 
determine the location and relative height of the acoustic peaks. We now expand those 
considerations by introducing a normal parameter set, based on the discussion of Kosowsky 
et al. (2002), to which the reader is referred for further details. See also Sandvik et al. (2004) 
for an application to parameter estimation techniques and Jimenez et al. (2004) for recent 
improvements including the polarization spectrum. 

• The position of the peaks is set by the ratio between the angular diameter distance 
relation (1.32, page 15) and the physical size of the acoustic horizon at decoupling, 
Eq. (4.24, page 82). Hence a first normal parameter which determines the overall 
angular scale is 

A ^ ° A ^\ , (4.43) 

cf. Eq. (4.17), which is just a general expression for the shift parameter. The scale factor 
at decoupling a^ec, or equivalently the redshift of decoupling, depends upon Qbh 2 and 
the Q m /fi r , for which Hu & Sugiyama (1996) provide an accurate analytical fitting 
formula. The effect of a change in A while keeping the other normal parameters fixed 
is displayed in Fig. 4.3. 

• The radiation/matter ratio sets the epoch of equality, which in turn determines the 
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Figure 4.3: Impact of the shift parameter (4.43) on the CMB temperature (left) and polar- 
ization (right) spectra, all other normal parameters kept fixed. The geometrical projection 
effect affects temperature and polarization in the same way. In the bottom panel, we plot 
the percent difference with respect to the reference model (black). 



amount of early ISW, thus we introduce the parameter 

O-dec 

which gives the matter to radiation density ratio at the time of decoupling 
of the first acoustic peak due to the early ISW is visible in Fig. 4.4. 

• The geometrical degeneracy is along the energy density in the cosmological constant, 
which also gives the amount of late ISW effect. Thus we use the parameter 

V = Q A h 2 . (4.45) 

As shown in Fig. 4.5, the impact is quite small in magnitude and solely on large angular 
scales, where cosmic variance limits our ability to constrain this parameter, making of 
the cosmological constant one of the worst determinable parameters with CMB data 
alone. 

• The parameter A already includes the effect of the baryon density on the spacing and 
location of the peaks, which is produced by the dependence of the sound horizon on 
the baryon content. Therefore keeping the other normal parameters and in particular 
A fixed while varying 

B = n b h 2 (4.46) 

isolates the baryon driving effect on the acoustic oscillations, which sets the relative 
height of the peaks. Since the polarization amplitude is proportional to the temperature 
dipole at recombination, which in turn is suppressed by a factor (f + R) 1 ^ 2 with R oc 
Oft/i 2 , a larger baryon density reduces the height of polarization peaks (Fig. 4.6). 



(4.44) 
. The boost 
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Figure 4.4: Impact of a change in the radiation to matter energy density ratio at decoupling 
(4.44) on the temperature (left) and polarization (right) spectra, all other normal parameters 
kept fixed. This can more easily be interpreted as a shift in the epoch of matter-radiation 
equality, which changes the amount of early ISW effect contribution around the first acoustic 
peak. 
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Figure 4.5: Impact of the energy density in the cosmological constant (4.45) on the CMB 
temperature (left) and polarization (right) spectra, all other normal parameters kept fixed. 
The impact is only on large angular scales due to the late ISW effect, where measurements 
are limited by cosmic variance and therefore cannot constraint much this parameter. 

• The CMB spectrum turns out to be almost linear in the combination 

Msn "' h \ 1 + 4j£) =nj '{ l + n) • {iA1) 
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Figure 4.6: Impact of the baryon density (4.46) on the CMB temperature (left) and polariza- 
tion (right) spectra, all other normal parameters kept fixed. A larger baryon content boosts 
odd peaks and suppresses even ones, see § 4.1.2.2. The height of the polarization peaks is 
reduced by a larger baryon content. 



which is a refinement of our previous approach of taking simply £l m h 2 as a determining 
parameter, see Kosowsky et al. (2002) for more details. 

• A good way of taking into account the degeneracy between the optical depth to reion- 
ization and the scalar normalization described in § 4.1.3.2 is to adopt the parameter 

T = A s exp(-2r re ) , (4.48) 

where for the adiabatic model considered here A s = Q is the scalar amplitude of the 
power spectrum of the gauge invariant curvature perturbation, cf. Eq. (4.6, page 79). 
When adopting a change in r re , the normalization A s is also changed as to keep the 
power above the third peak unchanged, thus avoiding artificial degeneracies with the 
other normal parameters, which would disappear if one adopted a different normaliza- 
tion convention (Kosowsky et al., 2002), see Fig. 4.7. 

• The scale dependence of the initial power spectrum is described by the scalar spectral 
index n s , as in (4.6). A value n s > 1 ("blue index") increases the power for wavevectors 
larger than the pivot scale, and thus yields more power for large multipoles; the converse 
is true for n s < 1 ("red index"), see Fig. 4.8. Therefore the impact on the CMB spectrum 
can be approximately modelled as 

Ci T ,E{n s ) « C iT , E (n s = 1) (- (4.49) 

with 4) a pivot point which should be chosen as to match kp (even though a different 
choice will only correspond to a change in overall normalization). 
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Figure 4.7: Impact of the parameter T denned in (4.48) on the CMB temperature (left) and 
polarization (right) spectra, all other normal parameters kept fixed. Increasing r re and the 
overall normalization at the same time as to keep the power above the third peak unchanged 
reveals the degeneracy between normalization and reionization. The only measurable effect 
is at large scales, where the temperature signal is enhanced for smaller T (and hence larger 
r re ) as well as the reionization bump in the polarization spectrum. 
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Figure 4.8: Impact of the scalar spectral index on the CMB temperature (left) and polariza- 
tion (right) spectra, all other normal parameters kept fixed. A blue spectrum (n s > 1) gives 
more power at larger multipoles. The glitches are numerical artifacts. 

Given the above correspondences, we can transform from the cosmological parameter 
set (£l m ,Cl/i,Q,b,Q, r ,h) into the normal basis (A,1Z,V,B,M.) and vice-versa by numerically 
inverting the relations (4.43-4.47). 
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4.3 General initial conditions 

As we have seen in § 3.2 and § 3.3, a Universe containing photons, massless neutrinos, cold 
dark matter and photons coupled to baryons admits four growing modes for the perturbations. 
To this set, one should add a baryon isocurvature entropy mode, which we have not described, 
but which behaves exactly as the cold dark matter mode, only rescaled by an overall constant 
^fo/^cdm (Gordon & Lewis, 2003). Thus without loss of generality, we can treat the CDM 
and baryon isocurvature modes as one single mode, and restrict our considerations to the 
four modes: adiabatic, CDM isocurvature, neutrino entropy and neutrino velocity. 

4.3.1 Angular power spectra for all modes 

The numerical integration of the evolution equations is necessary to go beyond the early 
time approximative solutions derived earlier and obtain the full angular power spectra for 
the different types of initial conditions. Recent versions of camb include the possibility 
of specifying neutrino entropy and velocity initial conditions, along with the adiabatic and 
isocurvature CDM ones. The resulting temperature and E-polarization spectra are displayed 
in Figures 4.9 and 4.10. Analogously to the adiabatic-CDM isocurvature case discussed in 
§ 4.1.1, in the most general case the modes are arbitrarily correlated with each other, and 
each of them possesses its own spectral index. In the figures we plot the correlators for total 
positive correlation between the modes, take scale invariant spectral indexes for all modes, 
n = 1 and we fix the other cosmological parameters to a flat, concordance ACDM model with 
early reionization, as emerged from the WMAP data for the pure adiabatic case. 

The collection of modes presents a wide variety of oscillatory structures, and very different 
amplitude ratios between the large-scale plateau and the peaks. Since the perturbation 
equations are linear, the most general CMB power spectrum is a positive definit superposition 
of all the modes. From a phenomenological point of view, we expect that widening the initial 
condition space to include all of the four possible modes, will lead to large degeneracies 
between initial conditions and cosmological parameters. We dedicate § 7.2 to a thorough 
investigation of this issue. On the other hand, if the neutrino isocurvature modes were non- 
zero, their contribution could conceivably allow to fit the CMB data without the need for a 
cosmological constant, a possibility which we analyze and reject in § 7.3. 

4.3.2 Modes superposition 

In the purely adiabatic scenario, initial conditions for scalar perturbations are described by 
two parameters, namely the overall normalization and the spectral index of the curvature 
perturbation power spectrum, as in Eq. (4.6, page 79). By enlargening the initial conditions 
space to include all of the four possible modes, we add nine amplitudes (three for the CDM 
isocurvature, neutrino density and velocity modes, and six for the correlators between the 
four modes) and three spectral indexes, for a total of 14 parameters describing the most 
general initial conditions. 

Although the dependence of the modes on the amplitudes is trivial, the numerical search in 
the initial conditions parameter space is complicated by the positive definiteness conditions 
on the total spectrum. The total temperature (or polarization) angular power spectrum 
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Figure 4.9: Temperature and E-polarization angular power spectra for the four modes consti- 
tuting the most general initial conditions for CMB anisotropies, Figure 1 of 2. The correlators 
are for positive total correlation between the modes, and we take all spectral indexes to be 
unity. The remaining cosmological parameters are fixed to a concordance, flat ACDM model. 
In the lower panel, the correlators are plotted in absolute value. The four modes are: ad 
(adiabatic), ci (CDM isocurvature), nd (neutrino density /entropy), nv (neutrino velocity). 



obtained by superposing the modes must be positive 

4 

C e =^2 Mij&l > V £, (4.50) 

with the modes correlation matrix M £ V n , where V n denotes the space of n x n real, 
positive semi-definite, symmetric matrices with in our case n = 4, and the C 1 / are computed 
for a fixed choice of cosmological parameters when only the corresponding element of the 
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Figure 4.10: Temperature and E-polarization angular power spectra for the four modes con- 
stituting the most general initial conditions for CMB anisotropies, Figure 2 of 2. 

correlation matrix is non-zero, i.e. for Mij = 1, all others vanishing. The elements of the 
correlation matrix are arranged so that the amplitudes of the pure modes are along the 
diagonal (so that Ma > for i = 1, . . . , 4) while the off-diagonal elements are the correlators 
amplitudes. Each correlator amplitude must satisfy Schwartz' inequality 

Mfj < MiM,j i,j = 1,...,4 (4.51) 

because of the positive definiteness condition (see Trotta, 2001, Appendix A for a proof), but 
in general the correlators amplitudes can of course be negative. Finally, Schwartz' inequality 
between all pairs i / j of M is a necessary but not sufficient condition for the positive 
definiteness of the correlation matrix. A sufficient condition is that all sub-determinants of 
M are positive or zero (see e.g. Heuser, 1993, proposition 172.5), giving the four sufficient 



conditions on the elements of M: 

Mn > , (4.52a) 

M n M 22 - Mi 2 2 > , (4.52b) 

M n M 22 M 33 + 2M 12 M 23 Mf 3 M 22 - M? 3 M 33 - M? 2 M 33 - M$ 3 M U > , (4.52c) 

detM>0. (4.52d) 



When numerically searching the initial conditions parameter space, the conditions (4.52) 
must be imposed by hand to avoid regions which would lead to non-physical (i.e. negative) 
angular power spectra. This approach is used in Trotta et al. (2001) and some related issues 
are discussed in § 7.2. 

A more convenient parametrization of the correlation matrix is employed in Trotta et al. 
(2003), where the matrix M £ V n is written as 

M = UDU T , (4.53) 
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U G SO n , D = diag(di, ^2, • • • , d n ) and di > 0, i G {1, 2, . . . , n}. Here 50 n is the space of 



n x n real, orthogonal matrices with det = 1 and n 
linear combination of generators Hj of SO n : 



U = exp 




4. We can write U as an exponentiated 



(4.54) 



with 



Hi 



/ o i o ... N 

-10 ... 
... 



(4.55) 



V : ::•■/ 

and so on, with — 7r/2 < ol{ < tt/2, i G {1, 2, . . . , (n 2 — n)/2}. In analogy to the Euler angles 
in three dimensions, we can re-parameterize U in the form 



(n 2 -n)/2 

U = exp (tpiHi 

i=i 



(4.56) 



n)/2}, whose functional 



with some other coefficients — tt/2 < tpi < tt/2, i G {1, 2, . . . , (ji 
relation with the cVs does not matter. The diagonal matrix D can be written as 

D = diag (tan(#i), . . . , tan(0 n )) , 



(4.57) 



with < 6i < tt/2, for i G {1,2, ...,n}. In this way, the space of initial conditions for 
n modes is efficiently parameterized by the (n 2 + n)/2 angles 0i,ipj. In our case, n = 4 
and the initial conditions are described by the ten dimensional hypercube in the variables 
(01, . . . , 9$, ipi, . . . , ipe). This is of particular importance for the numerical search in the 
parameter space. One can then go back to the explicit form of M using Eqs. (4.56), (4.57) 
and (4.53). This more efficient parametrization is employed in § 7.3. 

There is no optimal solution for an efficient and physically motivated parametrization 
of the initial amplitudes; another possibility, based on a ten-dimensional hypersphere, is 
employed in the analysis of Bucher et al. (2004). 



The fundamental problem of scientific 
progress, and a fundamental one of ev- 
eryday life, is that of learning from expe- 
rience. Knowledge obtained in this way 
is partly merely description of what we 
have already observed, but part consists 
of making inferences from past experience 
to predict future experience. 

Harold Jeffreys 
Theory of probability 
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We are now in a position to attack the task of actually determining the values of cosmological 
parameters from the observed CMB anisotropy. To this end, we need several statistical tools, 
which we introduce in § 5.1.1. The emphasis is on their application to the CMB: we work 
out the cosmic variance limit from first principles in § 5.1.2, and we present the Maximum 
Likelihood principle and its application to data analysis in § 5.1.3; we focus on the differences 
between the frequentist (§ 5.1.4) and Bayesian approach (§ 5.1.5) to statistics, explaining the 
procedures to assess likelihood and confidence intervals and their interpretation; we then 
discuss the implementation of two popular methods to sample the parameters space, the 
traditional gridding method (§ 5.1.6) and the more efficient Monte Carlo sampling (§ 5.1.7). 
In § 5.2 we explain the details of the Fisher matrix analysis, an handy and accurate technique 
to produce forecasts for the expected capabilities in terms of parameters extraction of future 
CMB observations. In the last section, § 5.3, we offer a brief historical review of the last 
decade of CMB observation, presenting the data-sets which are then exploited in Chapters 6 
and 7. 

5.1 Elements of probability and statistics 
5.1.1 Some concepts of probability theory 

We work in real, three-dimensional space, and we consider a field X which is defined in all 
points r £ R 3 in such a way that the probability of obtaining the value X at the point r 
is V(X,r). We call X an infinite dimensional random field and V its 1-point probability 
distribution function (pdf). In order to fully describe the random field X : we need to specify 
not only V, but also the 2-point pdf, denoted by V2{X\,T\,X2,T2), which describes the 
probability of getting the value X\ at the point y\ and the value X^ at the point r2; then the 
probability distribution for all triples of points, V3, and so on for an arbitrarily large number 
of points. 

From the definition of probability, the n-point pdf 's are not all independent, obeying the 
relations 



The field X is said to be statistically homogeneous if its 1-point pdf is the same in all points 




(5.1) 
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of space: 

V(X,r) =V(X) (statistical homogeneity), (5.2) 

and statistically isotropic if the 2-point pdf depends only on the distance between the points 
but not on the direction of the vector joining them: 

7 3 2(^l ) ri,X2,r2) =V2(Xi,X2,r) (statistical isotropy), (5.3) 

with r = |ri — r2|. In cosmology, all random fields are assumed to be homogeneous and 
isotropic. From now on we will always make this assumption. We denote with (•) the 
ensemble average over realizations of the field X (expectation value). For a function f(X), 
its expectation value is 

(f(X))= [ f(X)V(X)dX, (5.4) 

where the integration goes over all possible realizations of X defining the sample space Q. 
The expectation value of f(X) = X is called the mean of X. Under the assumption of 
isotropy, (X) is a constant independent on r. Therefore in cosmological perturbation theory 
we can always take the perturbations to have zero mean, since a constant offset can always 
be reabsorbed in a redefinition of the corresponding background quantity. 

Consider X(k), the harmonic transform of X with respect to the eigenfunctions of the 
Laplace operator; in R 3 this is the usual Fourier transform. Then as a consequence of 
homogeneity and isotropy, -X"(k) has the following properties: 

(X(k))=5^(k)(X) (5.5) 
(X(k).X(k'))=5^(k-k')g(k) (5.6) 

The real space correlation function is defined as 

^E^rO-^+r)). (5.7) 

It is the expectation value of X\ = X(ri) and X 2 = X{v\ + r) under the 2-point pdf, 

£(r) = f dXi f dX 2 V 2 {X 1 ,X 2 ,r)X 1 X 2 , (5.8) 

where in writing £(r) instead of £(r) we have assumed statistical isotropy. 

The field X is called space ergodic if we can perform a spatial average instead of an 
ensemble average and obtain the same result: 

lim (^ttrA [ f[X(r)]d i r = (f[X]). (5.9) 

R^oo \6 J J\r\<R 

Notice that ergodicity requires that the field is defined over an infinite space, such as R 3 . 
The temperature field of the CMB however lives on the two-sphere S 2 , which is a compact 
manifolds and therefore not ergodic. Therefore even if we could measure the anisotropies 
with no experimental error, we still would not be able to perform the ensemble average with 
perfect accuracy, see § 5.1.2. 

We denote by / the estimator for f(X), i.e. a procedure applied to a random sample 
of X to produce a numerical value for /, which is called the estimate. When applied to a 
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set of observations X° bs , X^ , . . . X^ s which constitute a random sample, the estimator / 
produces a distribution of estimates, and as such it too is a random variable. 

An important particular case is the Gaussian random field, for which all the n-point pdf 's 
are Gaussian. The 1-point pdf is then 

p(x) = vk^{S)' (5 - 10) 

while the 2-point pdf is given in terms of the field's correlation function £ as 

1 ( X? + X? - 2e(r)X 1 X 2 \ 
V 2 (X 1 ,X 2 ,r) = ———===exp 1 n J u l\> (5-11) 



and the 2-point pdf (or equivalently, the correlation function) contains the full statistical 
information. 

The statement that the correlation function determines the 2-point pdf completely is true 
only for a Gaussian field; in general, from (5.8) it is clear that after the integration £(r) 
only contains part of the information encoded in V 2 - For instance, Jones (1997) gives an 
interesting counter-example of a Gaussian and a non-Gaussian distribution with the same 
correlation function and yet with two different 2-point pdf's. 

5.1.2 The origin of cosmic variance 

It is instructive to compute explicitly the variance of the observed Cg starting from basic 
principles. If we assume that the temperature fluctuation 6 is an isotropic and homogeneous 
random field, then the coefficients of the harmonic expansion on the 2-sphere, the a^ m 's, have 
zero mean and variance given by the true Q's: 

{aim} = (5.12) 
(a} m ■ a,£i m i) = SwS mm 'Ci . (5.13) 

Inflation predicts that the aim's are very close to Gaussian variables, so we make the assump- 
tion of Gaussianity and for the pdf of a£ m we take 

V(a £m ) = _ e 2C i . (5.14) 



The true a£ m 's are of course inaccessible to us, but from the measured temperature fluctuation 
we obtain an estimate which we denote by a^ m . As an estimator for the power spectrum we 
define 

i 

ft-^TT E I4„I = ^V, (5.15) 

m=— I 

where we have introduced the variable 

\ai 



■m = -P 



Eq. (5.15) implies an ergodic hypothesis, since in the estimator we replaced the expectation 
value in (5.13) by an average over independent azimutal directions by summing over m. 
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The variable V is a sum of 2£ + 1 squared Gaussian variables with unit variance, and 
therefore (Kendall & Stuart, 1977) its pdf is the chi-square pdf with 21 + 1 = I degrees of 
freedom (dof): 



1-2 

v— 



From this we can write down the pdf for the estimator Ce, which is 




nCe) = ^V x? [ ^ ) (5.18) 



which shows that our estimator is distributed according to a chi-square pdf. For / — > oo the 
Central Limit Theorem guarantees that the distribution will become Gaussian, hence 

lim C t = C e (5.19) 

l— >oo 

and the estimator is said to be consistent. From (5.18) we can calculate the expectation value 
of Ci, finding 

(Ce) = Ci (unbiasedness) , (5.20) 

and its variance 

(Q 2 ) - ( c e) 2 = ■ 2 t+i c i ( efficienc y)- ( 5 - 21 ) 

We conclude that the fact that there are only It + 1 independent directions on the sky for a 
given multipole I limits the efficiency of our estimator for the power spectrum with variance 

(Cj) - (C e ) 2 2 

— = — (cosmic variance). (5.22) 

Ce 21 + 1 

Despite the fact that cosmic variance is a fundamental statistical limit, an ingenious 
method to circumvent it and to measure the temperature quadrupole with better than cosmic 
variance precision has recently been proposed by Skordis & Silk (2004). 



5.1.3 The principle of Maximum Likelihood 

The estimation problem can be generally stated as follows: starting from a limited number of 
observations, which constitute a random sample, one wants to reconstruct some properties of 
the underlying pdf. It is simpler to think of the properties of the pdf as unknown parameters, 
which we seek to determine. Consider a set of n observations d = { d° bs , d^ , . . . ,o^ bs } of 
the variable X and a set of p parameters 6 = {#° bs , #2 bs , . . . , @p hs }- The measurements have 
a conditional probability V(di\0) to be observed given the value 6 for the parameters. The 
problem at hand is to estimate the joint conditional probability 

n 

L(d\0) = Y[r(di\0) (5.23) 

i=i 

from the observations d. In the above definition, we thought of L as a function of the random 
variable X; however, once the observations have been done, we can think of L rather as a 
function of the unknown parameters for a given value of d and call it the likelihood function 
(LF). 
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The maximum likelihood (ML) principle affirms that as an estimate for 6 we should choose 
the value 6* which makes the probability of the actual result obtained, d, as large as it can 
be, i.e. 

L(d|0*) > L(d\9) (Maximum Likelihood) (5.24) 

for all possible values of 0. 

Instead of maximizing the LF, one can minimize the quantity 

£ = -21nL, (5.25) 

which we will call lognormal LF. 

If the pdf is Gaussian, then the ML estimation reduces to the usual least square fit: 
suppose that the measured d° bs are independent from each other and Gaussian distributed 
around their (unknown) true values di{6), with variance given by the experimental error <7° bs . 
Then minimizing C is equivalent to minimization of the quantity 



n 



d° hs -di{0) 
of 



which is called the chi-square. 

Applied to the problem of parameter extraction from CMB data, the ML prescription 
means that, given the measured power spectrum, C^ bs , with errors ae, we have to minimize 
the value of the chi-square by varying the cosmological parameters of interest. This procedure 
only gives information about the set of parameters which are the "most probable" to have 
generated the measurements at hand. However, quantifying the error on our estimate for 
the parameters is a more subtle business, since it involves dwelling into the exact definition 
of what probability means. There is a long dispute going on among specialists about the 
correct interpretation of probability, and some fundamental issues are still unresolved. One 
can take fundamentally two different point of views on the subject, the orthodox (frequentist) 
approach or the Bayesian point of view, as we now explain. A good introduction to Bayesian 
methods and a comparison with the sampling theory approach can be found in Box & Tiao 
(1973), while Kendall & Stuart (1977) give full details about frequentist theory calculations. 
Jaynes (2003) is a very enjoyable book, which provides a wider perspective on the logic 
of science and probability theory. A useful textbook with many stimulating examples of 
Bayesian inference is MacKay (2003). Frodesen et al. (1979) - written by experimentalists 
who have used on the field the methods described - is more praxis-oriented, and explains in 
a practical way the statistical mambo-jumbo. 

5.1.4 Orthodox probabilities — Confidence intervals 

The orthodox definition of probability - also known as "sampling theory" approach - is based 
on the empirical repeatability of the experiment, see e.g. Jaynes (2003). If an experiment is 
performed N times and the outcome A occurs in M of this cases, then the probability of the 
outcome A is 

P(A)=lim— . (5.27) 

N— >oo iv 

In the case of continuous variables, the concept of probability is defined as the limiting 
process (5.27) reached from a finite subdivision in N equiprobable intervals of the sample 
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space (Kendall & Stuart, 1977, Section 7.11, Vol. 1). The frequentist approach allows the 
definition and interpretation of exclusion regions or confidence intervals for the parameters, 
see below. It is the point of view usually adopted in particle physics, where an experiment can 
be repeated many times under the same circumstances. It is not very popular in cosmology 
though, where there is only one particular realization to observe. 

Confidence intervals - frequentist 

Confidence intervals in the frequentist approach have a straightforward interpretation: con- 
sider a random variable X whose pdf depends on the parameter 6 which we wish to estimate 
from a random sample {x° bs , . • • , x ^} with an estimator 6. For instance, one can 
think of 9 as the true mean ji of a normal distribution, and the estimator as the sample 
mean, fi = N~ x J2i x t° s - 

The estimates are distributed according to some pdf, which we denote by V e . Then a 
1007% confidence interval for the estimated parameter is the range [0i;#2] such that the 
probability content for the estimator is 7, i.e. 

[■02 

p{e 1 <e<e 2 )= v e d6 = j. (5.28) 

Notice that this is a statement about the probability of our estimate 9 to lie in a certain 
range, with the interpretation that, if we would draw the iV samples L times under identical 
circumstances, then the estimates produced by 6 fall in the range [6\; 62] "fL times. Therefore 
at this stage we are merely making a statement of the distribution of our estimator. If we 
want to convert this into a confidence statement for the true value 6, we can say that there is 
a probability 7 that the random interval [6>i;#2] will cover the true value 6. In other words, 
in the long run the limits 9\ and 62 are such that the statement 

01 < 9 < 9 2 (5.29) 

will be true in 1007% °f the cases. 

Unfortunately, the above interpretation is unapplicable to cosmology, where we cannot 
draw new samples at will from the underlying distribution, but we have to content ourselves 
with the only realization we happen to observe. However, we can still use as an estimator 
the least-square fit to the observed value, and interpret the result in frequentist's terms. 

Consider the least-square fit of (5.26), which applied to the CMB power spectrum is 




(5.30) 



where the observed Cf° s are estimated using the estimator (5.15): since each term is a sum 
of 21+ 1 Gaussian variables squared (the a£ m 's), its distribution becomes Gaussian by virtue 
of the Central Limit Theorem only for large I. The a^ hs are the estimated errors from the 
observations for each multipole, and 6 is the vector containing the p cosmological parameters 
of interest. The functional dependence of Ce(6) is given by the underlying theory, which we 
try to falsify by comparing its predictions with the actual observations. 

The least-square estimate for - which is equivalent to the ML estimator for Gaussian 
variables - is the value 6* for which the \ 2 reaches the minimum value x 2 * > which is called 



5.1 Elements of probability and statistics 



105 



least square estimate. Until this point, the least-square estimation makes no assumptions 
about the underlying pdf for the variables. To the extent to which the Q's can be considered 
as independent Gaussian variables, then the quantity x 2 * is distributed as a chi-square pdf 
with f = n — p dof, denoted by V x 2, see (5.17). Here n is the number of multipoles observed 
and p the number of fitted parameters. 

Under these assumptions, the distribution V^i provides a measure of the goodness of fit: 
assume that a given parameter set 0q is the correct one, and that the measured chi-square in 
our Universe for 6q is Xo', then if the measurement would be repeated many times in different 
realizations, the probability that the outcome will be equal or larger than the true value Xo 
is 



The interpretation in frequentist terms is straightforward: if some other parameters 9\ have 
X 2 (#i) = Xi ^ Xoi the chance that Q\ is the correct set and we are actually seeing a 
realization far out in the tail of the distribution is very small. 

It now remains to define confidence intervals for the parameters basing on the above fre- 
quentist interpretation: a 1007% confidence interval encompasses parameters whose measured 
X 2 is smaller than the value of corresponding to the quantile 1 of 1 — 7 for the distribution V 2 . 
In other words, if the measurements could be repeated many times, in the long run the above 
confidence interval would include the true value of the parameters 1007% °f the time. Thus 
the parameter space outside the estimated confidence interval is a proper exclusion region 
at the given confidence level. Notice that the frequentist confidence levels depend both on 
the total number of parameters fitted and on the number of independent data points we are 
using. 

We conclude this section with two remarks: firstly, the above assumptions of Gaussianity 
and independency are only partially fulfilled by the C^'s, therefore the outcome of such 
a frequentist analysis is only approximative (see Abroe et al., 2002 for a strictly correct 
frequentist parameter estimation, which involves the numerical sampling of the pdf which we 
simply took as a chi-square); and second, the clean interpretation of the frequentist approach 
is somewhat weakened by the fact that we are compelled to invoke measurements in other 
realizations which cannot take place, not even in principle. Bayesian statistics takes instead 
a more pragmatic approach, by dealing only with actual observations. 

5.1.5 Statistical inference - Likelihood intervals 

Bayesian statistics does not consider possible outcomes of measurements which are never per- 
formed. Instead, it exploits the actual data to update our knowledge about the probability 
of a certain statement, starting from our prior degree of belief. Criticism has been raised 
against this approach because the final inference depends on the prior information available, 
and therefore seems to suffer from a certain degree of subjectivity. However, Bayesian infer- 
ence can be applied to theories which are not repeatable and are unscientific in the frequentist 
point of view (e.g. the probability that it will rain tomorrow). It is based on Bayes' Theorem 2 , 

1 Given the pdf V, x is said to be the quantile of q if it satisfies V{u)Au = q. 
2 Rev. Thomas Bayes, 1763. 




(5.31) 
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which is nothing more than rewriting the definitions of conditional probability: 

V{A\B) = V ^ B ^g^ (Bayes' Theorem). (5.32) 
In order to clarify the meaning of this relation, let us write 6 for A and d for B, obtaining 

L(d\e)v{e) L(d\o)v(d) 

which relates the posterior probability V{6\d) for the parameters given the data d to the 
likelihood function L(d\6) if the prior pdfV(9) for the parameters is known. The quantity 
in the denominator is independent of 6 and it is called the evidence of the data for a certain 
model (MacKay, 2003). It is important for model comparison, but here we shall regard it 
just as a normalization constant. In short 

likelihood x prior 

posterior = . (5.34) 

evidence 

The prior distribution contains all the (subjective) knowledge about the parameters before 
observing the data: our physical understanding of the model, our insight into the experimental 
setup and its performance, in short the amount of all our prior scientific experience. This 
information is then updated via Bayes theorem to the posterior distribution, by multiplying 
the prior with the LF which contains the information coming from the data. The posterior 
probability is the base for inference about 6: the most probable value for the parameters is 
the one for which the posterior probability is largest. 

Bayes' postulate 3 states that in absence of other arguments, the prior probability should 
be assumed to be equal for all values of the parameters over a certain range, m ; n < 6 < max . 
This is called a "flat prior", i.e. 

p 

V{6) = [H(0 - min )tf(0 max - 0)] 11 [# maXii - flmi^r 1 , (5.35) 

i=l 

where H is the Heaviside step function and 6 ma , x ,i > ^min,i V i- This is one of the principal 
conceptual difficulties of Bayesian inference: a flat prior on 6 does not correspond to a flat 
prior on some other set f(0), obtained via a non-linear transformation /. Therefore the result 
of Bayesian inference do depend on the choice of priors, even though this usually does not 
constitue a major obstacle in practical problems - see however Bucher et al. (2004) for an 
instructive example of the role of priors. 

We see from Eq. (5.33) that the Maximum Likelihood principle is equivalent to Bayesian 
inference in the case of flat priors. We will always work with flat, top-hat priors unless 
otherwise stated. There is however an important conceptual difference. By writing the 
posterior distribution as 

W) = ^f' (5-36) 

it follows that Bayes' Theorem imposes to maximise the joint probability V(6, d) of 6, d, while 
Maximum Likelihood requires that the conditional probability L(d\0) should be maximised. 

3 Bayes' postulate is also known - perhaps with an hint of sarcasm - as the Postulate of Equidistribution of 
Ignorance. 
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Likelihood intervals — Bayesian 

Bayesian statistics use the LF to perform an interval estimation for 6. basing on Bayes' 
Theorem, Eq. (5.33), we not only consider the ML point in parameter space as the "most 
likely" value of the unknown parameter; we shall also interpret values further and further 
away as less and less likely to have generated the particular measurement at hand. Hence 
likelihood intervals drawn from the LF measure our "degree of belief" that the particular set 
of observations was generated by a parameter belonging to the estimated interval. This is 
radically different from the frequentist interpretation sketched above. 

Let us simplify the notation by writing L(6) instead of L(d\6), since now we consider 
the LF as a function of the parameters given a data set d. Assume further that the LF is a 
multivariate Gaussian distribution in the p parameters 0, i.e. 

L{6) = (det C)- 1/2 (2vr)" ? '/ 2 exp(-£/2) , (5.37) 
£ = -21nL = (0- n) T C-\Q- y.) (5.38) 

where T denotes transposition, /x is the expectation value of the parameters // = (0) and C 
is the covariance matrix 

Cij = ({9i - Hi){0j - nj)) . (5.39) 

From the likelihood one can then obtain the posterior distribution via (5.33), once the prior 
is specified. For the prior distribution V(9) a simple choice are so-called "flat" priors, a 
multidimensional top-hat function over some range which is supposed to encompass all the 
values of interest. Usually, in grid-based method the prior coincides with the extension of 
the grid, so that the prior is just a multiplicative constant and we can identify the likelihood 
with the posterior. As mentioned, this choice is somewhat arbitrary, since it depends on the 
basis chosen for the parameters. 

We can Taylor expand a general LF around its maximum which is given by our ML 
estimate 9* of /x, which on average coincides with the true mean for a normal distribution, 
{6*) = /x. By definition of the ML point the first derivatives vanish, dC/d9i{6*) = 0, and we 
obtain 




(5.40) 



If the LF is sharply peaked around (?*, i.e. the errors on the parameters are small enough, 
then third order terms are unimportant and the above Gaussian form is a good enough 
approximation everywhere in parameter space. By comparing with (5.38) we find that the 
covariance matrix can thus be estimated as 

C = F- where F„ S (\^\ ^ (5.41) 

is called Fisher information matrix (Kendall & Stuart, 1977, Chap. 15, Vol.1). 

According to our understanding of the LF as a measure of our degree of belief for the 
possible values of the probability that parameters within a certain region from the ML 
point have generated the observations should be proportional to the likelihood content of the 
region. The probability content depends on whether we are estimating all parameters jointly, 
or keeping some of them fixed to their ML value, or rather disregarding a certain subset by 
integrating over them (marginalization) . We consider each case in turn. 
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Estimation of all p parameters jointly. 

Without loss of generality we can take in the following /x = in Eq. (5.38), which can always 
be achieved by shifting the origin of the coordinate system in parameter space. Contours of 
constant likelihood define hyperellipses in parameter space with some probability content we 
wish to determine. To this aim we consider the quadratic form 



and for the LF (5.37) the condition Q(0) = Qf, for some constant gives the contours 
of constant likelihood. We write to indicate that the numerical value of the constant 
depends on the number of parameters under consideration, s, and on the desired probability 
content of the hyperellipse, 7. It can be shown (Kendall & Stuart, 1977, Chap. 8, Vol.1) that 
the quadratic form Q is chi-square distributed with s dof, which allows us to relate with 
the probability content of the ellipse. 

If we want a confidence region containing 1007% °f the joint probability for all p param- 
eters, then s = p and is determined by solving 



The projection (not the intersection) of the hyperellipse Q(0) = onto each of the param- 
eter axis gives the corresponding likelihood interval for each parameter when all parameter 
are estimated simultaneously (which we will call "joint likelihood interval"). 

It is a simple geometrical problem to find an analytical expression for the joint likelihood 
interval for each parameter: for the parameter 1 < d < p, the intersection of the hyperellipse 
with the hyperplane defined by Od = c, with c a constant, gives either an hyperellipse in p— 1 
dimensions, or a point or else an empty set. The extrema of the joint likelihood interval for 
the parameter d are given by the values of c for which the p — 1 dimensional ellipse reduces 
to a point. 

To find the equation of thep— 1 dimensional ellipse we proceed as follows: define C -1 = M 
and write Q(0) = in the form 



q(o) = e T c~ 1 e 



(5.42) 




(5.43) 




(5.44) 



where we have defined 



e 



... , 0d-i, Oa+i, . . . , Op) G R J 



p-i 



(5.45) 




M 



"1^+1,1 



(5.46) 



\ rripi 



Now we diagonalize the submatrix M 



diag(Ai, . . . , A p _i) = A 



ITMU 



(5.47) 
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finding the eigenvalues A*, i < 1 < p — 1 and eigenvectors (u\, . . . , u p -i), and after some alge- 
braic manipulations of (5.44) we arrive at the equation of the p—1 dimensional hyperellipse 

p-l p-l c 2 / \ 2 

y~] Kz"i = Q7 - rn dd c 2 +y~] y I m dj Uji , (5.48) 
i=i i=i * \jyd / 

where we have defined the new variables 

* = (ffU)i + Y.Y1 m <ii u ii ' < i < p - 1 . (5.49) 



The above hyperellipse becomes degenerate if 



p-i 

x i z f = ° ( 5 - 5 °) 

i=i 



from which we obtain a quadratic equation for c with solutions 



--mm, max — r 1/2 " 

■*P-1 \-l 



(5.51) 



It is easy to show that the positive definiteness condition for the Fisher matrix guarantees 
that the quantity under the square root in the denominator is always > 0. In conclusion, the 
joint likelihood interval for the parameter 9 d with likelihood content 7 is given by 

Cmin ®d ^ C max • (5.52) 

Estimation of k < p parameters, the others fixed. 

We are sometimes interested in giving confidence intervals for some subset k < p of the 
parameters, while assuming the other p — k parameters as (exactly) known. Without loss of 
generality we shall take the first k parameters as the one we are interested in, and we split 
the parameter vector as 

/ 1\ 



u 



(5.53) 



with t G R fc and u £ R p k . Correspondingly we write the covariance matrix in (5.38) as the 
Fisher matrix estimate of (5.41), 



1-1 



A G 
G T B 



(5.54) 



where A 6 R fcxfc , B £ W°~ kx P- k and G € RP- fcxfc . 

If the known parameters u are held fixed at their ML value, the LF for the parameters of 
interests t is simply the full LF restricted to the k subspace, 

L(t|u*) oc exp(-^t T At) , (5.55) 
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with an appropriate normalization constant, and the new covariance matrix V G j{^ xfc f or 
the k parameters of interest is 

V = A~ 1 (conditional). (5.56) 

In particular, we often consider the best case scenario in which all parameters but one are 
supposed to be known exactly, say from independent observations or theoretical prejudice, 
and therefore k = 1. Then the la likelihood interval for the first parameter only is the square 
root of the covariance matrix element, and it is given by (all others fixed to their ML value) 

<ti = -J=. (5.57) 
v/n 

Estimation of k < p parameters, the others marginalized. 

Instead of fixing some parameters, we may prefer to disregard them completely, by integrating 
over them in order to obtain the marginalized likelihood in the k parameter of interest: 

L(t) oc / L(t,u)du, (5.58) 

with a suitable normalization constant so that the probability content of the marginalized 
LF is equal to unity. 

The marginal LF for t is still a multivariate Gaussian, with the same covariance matrix 
as the full LF, only with the last p — k rows and columns deleted: 

V ij =[F- 1 ].. 0<i,j<k (marginalized). (5.59) 

This result can be obtained by performing explicitly the integration (5.58) or more elegantly 
by using the properties of the characteristic function (Kendall & Stuart, 1977, Chap. 4, Vol.1). 
In terms of the splitting (5.54), the covariance matrix for the marginalized distribution is 

V = [A — GB _1 G T ]~ 1 . (5.60) 

Very often one quotes marginalized likelihood intervals for one parameter alone, k = 1 
with all other parameters marginalized, in which case the la error is given by 



a 1 = ^(F^) 11 . (5.61) 

If the parameters are uncorrelated, then F is diagonal, and fixing u or marginalizing over 
them is equivalent, otherwise the resulting likelihood intervals for the parameter(s) of interest 
are in general different, with the marginalized interval being broader. 



5.1.6 Gridding method 

In the numerical fit to the data, the shape of the LF is determined by evaluating the least- 
square estimator (5.26, page 103) at each point on a grid in the p dimensional parameter 
space and the minimization of the chi-square in the desired range of parameters gives the ML 
estimate. 
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1007% 

Likelihood content 


68.3% 95% 95.4% 99% 99.7% 
(la) (1.96a) (2a) (2.58a) (3a) 


1 parameter, 

2 parameters, Q 2 , 


1.00 3.84 4.00 6.63 9.00 
2.30 5.99 6.17 9.21 11.80 



Table 5.1: A% 2 = for marginalized likelihood intervals in one parameter (k = 1) or 
marginalized likelihood contours in two parameters (k = 2) for the given joint likelihood 
content. 



Assuming that the measurements are normally distributed around their true value we 
have 

L(d|0) = L max exp [-x 2 (0)/2] . (5.62) 

Prom this we can use the above prescriptions to determine likelihood or confidence intervals 
from real data. 

In the frequentist analysis, the boundaries of the confidence regions represent exclusion 
plots at the given confidence level: they are found as the contours of constant x 2 using the 
relation (5.31, page 105), independently of the value of the chi-square at the ML point. In 
Bayesian statistics, the likelihood intervals are instead drawn around the ML point, hence 
their extension depends on the best fit value. This applies only to the gridding method, not to 
the Monte Carlo sampling described below in § 5.1.7. It is customarily to quote marginalized 
likelihood intervals for one parameter only or to plot two-dimensional likelihood contours to 
show degenerate direction between two parameters (also see below the paragraph discussing 
the maximization approach instead of marginalization); for these two cases, the cook-book 
prescription for Bayesian (Maximum Likelihood) statistics on a grid of samples in parameter 
space is: 

• find the ML point L max in the grid of parameters by minimizing the \ 2 of Eq. (5.30, 
page 104) and mark this point as Xmim y° ur least-square estimate of the best fit; 

• determine the boundaries of the region containing 1007% °f likelihood as the values of 
the parameters for which the x 2 has increased by an amount Ax 2 = Q 1 ^ (k = 1,2 the 
number of parameters considered) with respect to Xmin- 

• The values of can be found for every desired likelihood content using the relation, 
cf. (5.31, page 105) 

rQ k 

7 = y 1 V x 2 k {u)du. (5.63) 

Table 5.1 displays the values of Ax 2 for k = 1,2 and for some popular choices of 
likelihood content. 

In a real situation, the LF computed using (5.62) will not be exactly a multivariate 
Gaussian, and the likelihood intervals obtained with this method will only approximatively 
encompasses the stated probability content. There are methods which improve on the as- 
sumption of a normal distribution presented here, see for instance Bond et al. (2000); Bartlett 
et al. (2000); Wandelt et al. (2001); Jaffe et al. (2003). 
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Finally, notice that likelihood (Bayesian) contours are usually much tighter than the 
confidence contours drawn from the frequentist point of view. This is a consequence of the ML 
point having often a x 2 / f much smaller than 1, because the data-sets are highly consistent 
with each other and also because usually not all points are completely independent. For the 
CMB, this was the case when one considered a combination of several data-sets before WMAP, 
as we discuss in § 7.2. If we consider the usual situation in which likelihood contours are 
drawn in a two dimensional plane with all other parameters marginalized over, the frequentist 
approach is more conservative than Bayesian statistics: the region corresponding to the 
desired confidence level (frequentist) or likelihood content (Bayesian) 7, has bounds given 
by X 2 (0) — Qfej with k = 2 for Bayesian statistics and two-dimensional plots, and k = 
f for frequentist statistics independently on the number of parameters considered. Since 
in general and for reasonably good ML values Xmm^®(f) an< ^ / > 2, we have that the 
probability/likelihood content is the same, i.e. 

/ V Y 2(u)du= / V Y 2(u)du (5.64) 
Jq] f JQl 2 

only for Q'j > Q\. When looking at Bayesian likelihood contours one should thus keep 
in mind that a point more than, say, 3<7 away from the ML point is not necessarily ruled 
out by data. In order to establish this, one has to look at confidence contours, i.e. ask the 
frequentist 's question. This is pointed out in a penetrating way by Gawiser (2001). 



Maximization instead of marginalization 

In practical applications, involving up to a dozen parameters, it is an exceptionally demanding 
task to perform the multidimensional integral of Eq. (5.58). A computationally more feasible 
alternative which avoids the time consuming integration is to maximize the parameters we 
are not interested in, u, for each value of the parameters of interest, t, obtaining 

L(t) oc maxL(t, u) . (5.65) 

u 

If the distribution is Gaussian, then the two procedures give the same result: maximizing 
L(t, u) corresponds to minimization over u of the quadratic form 6 T C~ 1 9, with the notations 
of (5.53) and (5.54). Differentiating with respect to u, we find that the minimum of the 
quadratic form lies at 

u = -B" 1 G T t, (5.66) 

and therefore 

L(t)ocexp--t T [A-GB _1 G T ]t, (5.67) 

which is the same result we found by marginalizing over u, Eq. (5.60). Numerical investiga- 
tions have found that maximization tends to underestimate errors when the assumption of a 
Gaussian distribution is not accurately fulfilled (Efstathiou et al., 1999). 



5.1.7 Markov chain Monte Carlo 

A big practical limitation to grid based parameter extraction techniques is that the number 
of CMB spectra needed scales exponentially with the dimensionality of the parameter space 
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considered. Even with fast parallel computing, the required computational time quickly be- 
comes very large, even for a moderate number of points in each dimension. Interpolation 
algorithms and other optimization techniques have been employed to circumvent this funda- 
mental limitation, allowing the handling of up to a dozen parameters (Tegmark et al., 2001). 
Nevertheless, this method shows a lack of flexibility if one wants to add new data-sets or 
incorporate new parameters or theoretical priors. At the latest with the coming of WMAP 
data, the days of grid-based parameter extraction seem to be over, since the accuracy of 
WMAP-like data cannot be exploited with the insufficient resolution and flexibility offered 
by this technique. 

Markov chain Monte Carlo (hereafter MCMC) methods are now becoming the standard 
tool to determine parameters from CMB data, combine it with large scale structure con- 
straints or investigate the effect of different priors. As advocated e.g. by Christensen et al. 
(2001), MCMC is a method to generate a sequence of (correlated) samples, called a Markov 
chain, from the posterior pdf of the parameters given the data, (5.33, page 106). The great 
advantages are that the computational time scales approximately linearly with the number of 
dimensions of the parameter space, and that once the chain has properly converged (see be- 
low for more details), the marginalized posterior distribution for the parameter (s) of interest 
can be simply recovered by plotting histograms of the sample list, thus avoiding completely 
the costly integration. It is easy to adjust the prior information or to include new data-sets 
into an existing chain without having to recompute it, with a procedure called "importance 
sampling". 

One can think of the MCMC algorithm as an efficient integration technique to evaluate the 
posterior distribution in Bayes' Theorem, Eq. (5.33, page 106). The Monte Carlo sampling 
does not rely on the assumption of Gaussian pdf 's: indeed, the direct sampling of the posterior 
permits to reveal features due to its non-Gaussian distribution, and therefore vastly improves 
on the methods based on chi-square goodness-of-fit described above. Besides those undeniable 
advantages over the grid method, the popularity of MCMC in the cosmology community has 
been boosted by the timely public release of the COSMOMC package (Lewis & Bridle, 2002), 
which integrates the code CAMB for the computation of the CMB power spectra 4 and several 
useful tools for the generation and interpretation of Markov chains using CMB and other 
cosmological data-sets. Further details about MCMC methods can be found e.g. in Gilks 
et al. (1996); MacKay (2003). 

The Metropolis-Hastings algorithm (Metropolis et al., 1953; Hastings, 1970) is the core of 
the sample generation, and produces a Markov chain whose equilibrium distribution is the 
target probability density, here the posterior V(6\d). The chain is started from a random 
point in parameter space, 6q, and a new point Q\ is proposed with an arbitrarily proposal 
density distribution q(0 n ,6 n+ i). The transition kernel T(6 n ,0 n+ i) gives the conditional 
probability for the chain to move from n to 9 n +\, and it must satisfy the "detailed balance" 

V(O n+1 \d)T(0 n+1 ,0 n ) = V(0 n \d)T(0 n , G n+1 ) (5.68) 

so that the posterior V(0\d) is the stationary distribution of the chain. This is achieved by 



4 Both codes are available at: http://cosmologist.info. 
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defining the transition kernel as 



(5.69) 




} 



(5.70) 



where a(6 n , 6 n+ \) gives the probability that the new point is accepted. Since V(9\d) oc 
L{d\6)V{6) and for the usual case of a symmetric proposal density, q(8 n ,6 n+ i) = q(6 n+ i,9 n ), 
the new step is always accepted if it improves on the posterior, otherwise it is accepted with 
probability L(d\0 n+1 )V(6 n+1 )/L(d\0 n )V(6 n ). 

The result is a sample list from the target distribution, from which all the statistical 
quantities of interest can readily be evaluated. The samples are correlated with each other, 
a fact which does not constitute a problem for the statistical inference on the parameters; 
however, importance sampling does require uncorrelated samples, which can be obtained from 
the original chain by suitably "thinning" the chain, i.e. by retaining only one sample every 
N, with N of the order of a few thousands. Other important practical issues in working with 
MCMC methods involve: 

• Burn in period: the initial samples need to be discarded, since the chain is not yet 
sampling from the equilibrium distribution. The burn in can roughly be assessed by 
looking at the evolution of the posterior and at the position of the chain in parameter 
space as a function of the step number. When the chain is started at a random point of 
the parameter space, the logarithm of the posterior pdf is large (and thus the posterior 
probability is small), and becomes smaller at every step as the chain approaches the 
region where the fit to the data is better. Only when the chain has moved in the 
neighborhood of the ML point the curve of the log posterior as a function of the step 
number flattens around the best fit value. This is illustrated in the left panel of Fig. 5.1. 
Another useful diagnostic is the evolution in parameter space of multiple chains, which 
are started from different points. In a well-behaved situation all of the chains converge 
after the burn-in period to the same region around the ML point, see the right panel 
of Fig. 5.1 for an illustration. 

• Convergence: assessing convergence of the chain essentially means to know when 
we can stop, having gathered a number of samples large enough to correctly derive the 
statistical quantities of interest. This is in general a difficult question, see e.g. Cowles & 
Carlin (1996); Mengersen et al. (1999) and references therein. The COSMOMC package 
offers several useful diagnostic tools, including the Raftery &; Lewis (1996) statistics 
and the Gelman & Rubin (1992b) criterion. 

• Multiple chains: there is a debate among experts about the best strategy between 
having one long chain or rather several shorter ones running in parallel, see e.g. Gelman 
& Rubin (1992a,b); Raftery k, Lewis (1996). Multiple independent chains offer the 
advantage of being computed in parallel, and can be started in different points of 
the parameter space to ensure good mixing, i.e. an adequate exploration of the whole 
parameter space. 

• Starting points: after the burn in period, the converged chains do not depend on the 
initial starting points. However, it is convenient to start the chains in the proximity of 
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Figure 5.1: Illustration of the burn-in period. Left panel: the logarithm of the (non- 
normalized) posterior, — lnV(0\d), as a function of the step number for four Monte Carlo 
chains. After the burn-in period (dotted, vertical lines), the value flattens and the chains are 
sampling from the target distribution. Right panel: the four chains (in different colors) are 
started in different points of a 6-dimensional parameter space and all converge to the same 
region after the burn-in. The vertical axis gives the number of steps. 

the parameter region where the best fit is supposedly located, so that convergence will 
be quickly achieved, and the sophisticated choice of the starting points proposed by 
Gelman & Rubin (1992b) is usually not necessary in cosmological applications. Also 
one has to take into account the fact that the MCMC is a local algorithm, which can 
be trapped inside local minima far away from the global minimum of the posterior, an 
issue which is intimately related with the choice of the proposal density. The use of 
simulated annealing algorithm via the introduction of a finite temperature for the MC 
can sometimes help in achieving convergence in a weird-shaped parameter space. 

• Proposal density: the optimal choice of the proposal density is the key parameter 
for an efficient implementation of the MCMC method. A simple possibility for the pro- 
posal density q(0 n ,6 n+ i) is a Gaussian with step size along the parameter direction 
i, independently on the chain position. Finding the optimal value of Si is a trade-off be- 
tween a large step size, which will result in almost all step being rejected and therefore 
in low efficiency, and a too small value, for which the chain performs a random walk 
and the tails of the distribution will not be adequately sampled, giving serious underes- 
timate of the likelihood intervals for the parameters. One can also roughly sample the 
distribution with a short chain, construct from the samples the covariance matrix of 
the posterior distribution and use this information to construct a new parameter basis 
approximately aligned with the degeneracy directions (Lewis & Bridle, 2002), which 
ensures a more efficient exploration. A sampling method which exploits the known 
degeneracies of the CMB and uses normal parameters as basis has been proposed by 
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Slosar & Hobson (2003), and it can dramatically enhance the efficiency of the MCMC 
algorithm, especially for large data-sets as the one expected for the Planck satellite. 

5.2 Fisher matrix forecasts 

An important issue is to assess quantitatively the expected performance of future CMB ex- 
periments in terms of the precision reached in the determination of cosmological parameters: 
this helps in understanding whether an observed degeneracy is a consequence of the lack of 
precision in the data, or else it is of fundamental nature and will not be lifted by upcoming 
or even ideal (i.e. cosmic variance limited) measurements; it also gives estimates of the neces- 
sary instrumental characteristics to achieve a certain precision, and on the optimal observing 
strategies, e.g. full sky coverage versus high resolution mapping of a patch only. 

It is possible and indeed necessary at the development stage of a CMB experiment to 
investigate in detail the above questions by producing mock realizations of the CMB sky and 
run Monte Carlo simulations of the observations. From the theorist's point of view, however, 
it is often sufficient and preferable to resort to a simpler alternative, which gives quantitative 
and accurate results with very small computational requirements: a Fisher matrix analysis 
(FMA) (Knox, 1995; Kosowsky et al., 1996; Tegmark et al., 1997; Zaldarriaga et al., 1997; 
Bond et al, 1997; Eisenstein et al., 1998b; Efstathiou & Bond, 1999; Tegmark et al., 2000). 

5.2.1 Experimental parameters 

As explained in § 5.1.5, if the LF is a multivariate Gaussian then the Fisher information 
matrix defined in Eq. (5.41) is an estimate of the inverse of the covariance matrix for the 
parameters under scrutiny. Since any LF can be expanded up to second order in the vicinity 
of the ML point as in (5.40), the goal is to compute the Fisher matrix for the CMB power 
spectrum, including the noise of the future experiment, and estimate from it the covariance 
matrix using the results for Bayesian statistics presented in § 5.1.5. 

The estimator (5.15) for the CMB temperature power spectrum (below we generalize the 
result to include polarization information as well, § 5.2.2) needs to be modified to subtract 
off the noise contribution and correct for the fact that the measured aim's are a smeared out 
version of the true ones, resulting from the convolution of the signal with the experimental 
beam, giving (Knox, 1995; Bond et al., 1997) 

V m=-£ ) 

In the above expression, the two experimental parameters are the inverse weight per solid 
angle Wb, which accounts for the experimental noise, and the beam width £ b , which corrects 
the smoothing due to the Gaussian profile of the beam. These two parameters are written in 
terms of the fundamental specifications of the experiments, namely the rms pixel noise (or 
sensitivity per resolution element) a b and the angular resolution 6b (FWHM) expressed in 
degrees as 

vj- 1 = (a b 6 b ) 2 and £ b = V81n2/0 b . (5.72) 

In the limit of infinite resolution, 9b — ► 0, and no experimental noise, Ob — ► 0, we recover the 
cosmic variance limited estimator (5.15). 
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As in § 5.1.2, we can now find the pdf for (5.71), 

v{Cl) = ^vw^ i'^w J ' (5J3) 

recalling Z = 2£ + 1 and the chi-square distribution displayed in Eq. (5.17). The correction 
for the noise and the beam size makes this estimator biased, i.e. 

(C e ) = C e + , (5.74) 

which is exactly what we need to compensate for the experimental noise and beam width. 
From this it follows from (5.23) and (5.38) that the log-normal LF has the form 



hL[C e (0)+wzW +1 V4) + 



V(M-i)/*g 



(5.75) 



and we have dropped several normalization factors which do not depend on 6. Using (5.74) 
we then obtain for the Fisher information matrix defined in (5.41) 







where the quantity (ACg) 2 is the standard deviation on the estimate of CV, and takes into 
account both the cosmic variance and the experimental error, 

(AQ) 2 = + w- h l e l ^l) 2 . (5.77) 

The sum over multipoles runs over the multipole coverage of the experiment, between £ m ; n 
and l max . 

Thus once the experimental parameters are specified, the computation of the Fisher ma- 
trix only requires the knowledge of the derivatives of the power spectrum with respect to the 
cosmological parameters. The derivatives are determined numerically as double sided deriva- 
tives, see § 5.2.3, and this requires the computation of 2p + 1 spectra only for p parameters, 
which is a very small computational effort compared with the full numerical exploration of 
the likelihood surface. 



5.2.2 Generalizations 

In this section, we develop the necessary general machinery which refines the above results 
including a more detailed experimental parametrization and polarization information. 

Most experiments present several frequency channels, each of them characterized by its 

TP 'TP 

own sensitivity a c ' and angular resolution C ' , both for temperature (T) and E-polarization 
(P). Furthermore, even full-sky experiments only cover a fraction of the sky, since point 
source subtraction, foreground removal and galactic plane cuts have to be performed on 
the full-sky maps. This can be approximately taken into account by assigning a "clean" 
fraction / s k y to the experimental coverage. These factors are accounted for by generalizing 
the expression (5.77) to (Efstathiou & Bond, 1999) 

< ac <> 2 = wtW q+b " )2 ' (5 ' 78) 
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where the inverse noise term is given by 

Bf = 2> c e-^ +1 )/^ (5.79) 

c 

and w c ,£ c are given by (5.72) for each channel c. 

In the more general case, we also want to include E polarization and temperature- 
polarization correlation (C) along with temperature information: then instead of a single 
derivative we have a vector of three derivatives with the weighting given by the the inverse 
of the covariance matrix of the spectra, and the Fisher matrix is given by (Zaldarriaga & 
Seljak, 1997), 

^EE>-(Wf . (5.80) 



where Cov -1 is the inverse of the covariance matrix for the spectra evaluated at the ML 
point 6* , Oi are the cosmological parameters we want to estimate and X, Y stands for T 
(temperature), E (polarization mode), or C (cross-correlation of the power spectra for T and 
E). 

For each t one has to invert the covariance matrix and sum over X and Y. The diagonal 
terms of the covariance matrix between the different estimators are given by 

Cov(C^) = (2£+ 2 1)/sky (^ + ^ 2 ) 2 (5-81) 

C ° V( ^ £) = (2£+\)f sky {CEe + B ^ )2 (5 - 82) 

Cov ( C Ct) = ( 2i + 1 i)/ sky i C ct + + B^)(Cei + B~l)] , (5.83) 
and the off diagonal terms are 

Cov(C T ^) = (5.84) 

Cov{C T iC C t) = {2e+ \ )fk Cci(Cti + B-*) (5.85) 

Cav{C El C c i) = {2l + 2 1)/&k CctiC E i + B-\) , (5.86) 

where B^ = BJ 2 given in Eq. (5.79) and Bp £ is obtained using a similar expression but 
with the experimental specifications for the polarization channels. 

5.2.3 Accuracy issues 

The accuracy of the Fisher matrix predictions for the errors depends on a number of issues: 

• The FMA assumes that the true values of the parameters are in the vicinity of the ML 
point 6* . The validity of the results therefore depends on this assumption, as well as 
on the assumption that the aim's are independent Gaussian random variables. 

• This is a local method based on a quadratic expansion of the LF. Only if the FMA 
predicted errors are small enough, the method is self-consistent and we can expect the 
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FMA prediction to correctly reproduce the exact behavior, and in particular the cor- 
relations between parameters, thus revealing the degeneracy directions. The expansion 
up to second order is exact if the dependence of the Ce on the parameters is linear, 
therefore great importance is attached to the choice of the parameter set with respect 
to the FMA is performed. As shown in Kosowsky et al. (2002), employing the normal 
parameters set discussed in § 4.2 as a base, the accuracy of the FMA predictions is 
greatly enhanced. This is because the spectra are almost linear in the normal parame- 
ters in the vicinity of the best fit. 

• Special care must be taken when computing the derivatives of the power spectrum with 
respect to the cosmological parameters. This differentiation strongly amplifies any 
numerical errors in the spectra, leading to larger derivatives, which would artificially 
break degeneracies among parameters. Double-sided derivatives reduce the truncation 
error from second order to third order terms, but the correct choice of the step size is 
a trade-off between truncation error and numerical inaccuracy dominated cases (Press 
et al., 1992). 

5.3 CMB observations: a brief historical account 

The experimental status of CMB observations has made giant leaps over the last ten years, 
thanks to spectacular advancements in detector technology. As demonstrated in Chapter 6, 
CMB data nowadays provide stringent tests which severely constrain cosmological model 
building, and call for more refined theoretical and computational approaches which take into 
account subtle physical effects which were so far ignored or thought to be irrelevant. Here 
we provide a personal selection of a few milestones of this development, in order to put the 
current and future experimental achievements into a wider perspective. 

The first detection of temperature anisotropy came in 1992 with the Differential Microwave 
Radiometer (DMR) aboard the COBE satellite after one year of observations on angular 
scales larger than 7° (Smoot et al., 1992; Wright et al., 1992) or multipoles ^20. The key 
results of the full four year DMR observations are summarized in Bennett et al. (1996, see 
references therein): the quadrupole amplitude was measured for the first time, the spectral 
tilt of the large scale spectrum was found to be compatible with an Harrison-Zel'dovich 
spectrum and no evidence of non-Gaussianity of the fluctuations was discovered in the data. 
The FIRAS instrument was devoted to the study of the CMB spectrum (Fixsen et al., 1996), 
and obtained a precision measurement of its temperature (T = 2.728 ± 0.002 K), while 
constraining deviations from a perfect black body spectrum to be less than about one part 
in 10 5 with 95% confidence. 

The Saskatoon and Toco data provided the first hint for the presence of the first adiabatic 
peak (Netterfield et al., 1997; Miller et al., 1999; Knox & Page, 2000), but at the turning 
of the millennium several groups independently reported measurements of the temperature 
anisotropy with a resolution of a few arcminutes, sufficient to unambiguously reveal the first 
peak and start exploring the subsequent ones: BOOMERanG (de Bernardis et al., 2002; Net- 
terfield et al., 2002) and Maxima (Hanany et al., 2000; Lee et al., 2001), both balloon-borne 
bolometric experiments, mapped the multipole region 80<^<1000; the CBI (Padin et al., 
2001) and DASI (Halverson et al., 2002) ground based interferometers covered a similar mul- 
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tipole range but with a completely different technology, which had the advantage of being free 
from the calibration uncertainty of bolometric receivers. The Archeops experiment (Benoit 
et al., 2003a), conceived as a balloon-borne precursor of the HFI bolometric instrument for 
the Planck satellite, observed a larger portion of the sky, and thus provided an estimation of 
the temperature power spectrum which for the first time encompassed the first peak region 
and also partially overlapped with the COBE measurement, in the range 15 < £ < 350. Given 
the experimental calibration uncertainty of the bolometers, which is about 10 — 20%, this 
permits to test the relative calibration between COBE and the other experiments with data 
in the t > 50 region, and perform a comparison of the height of the first peak with respect to 
the large scale plateau. All of this data generally agrees well on the position and shape of the 
first peak, but their resolution is insufficient to permit the reconstruction of the subsequent 
ones with high confidence (de Bernardis et al., 2002; Durrer et al., 2003b). 

From the point of view of parameter extraction, each of the above data sets by its own 
as well as their combination leads to a broad agreement of an approximately flat Sl tot ~ 1 
universe with scale invariant spectral index n s ~ 1, with the la likelihood intervals being 
of the order of 10% and somewhat depending on the compilation of data and on the prior 
assumed (Stompor et al., 2001; Lange et al., 2001; Pryke et al., 2002; Netterfield et al., 
2002). The estimation of the baryon density proved to be more controversial, because of 
discrepancies and a lack of resolution at the level of the second and third peak: in particular, 
the BOOMERanG 1998 and MAXIMA data seem to favor a baryon content about 50% 
larger than predicted by BBN, around fi^/i 2 ~ 0.03 (Tegmark k, Zaldarriaga, 2000; Lange 
et al., 2001; Stompor et al., 2001), a discrepancy which disappears with the improved beam 
reconstruction of the BOOMERanG 2000 observations (Netterfield et al., 2002). Inclusion of 
supernova? data or the Hubble Space Telescope prior for the Hubble constant, together with 
the flatness determination, points toward a universe dominated by a cosmological constant. 

Before the WMAP satellite delivered its results, ground based instruments pressed on 
and opened up two new observational directions: very small scale observations (4' — 5') 
and E-polarization detection. The CBI interferometer, in two different configurations called 
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Figure 5.2: The small scale temperature angular power spectrum observed by CBI "mosaic" 
during two years and by ACBAR. The shaded region shows the excess power at small scale, 
compatible with the SZ effect. Reprinted from Readhead et al. (2004). 



5.3 CMB observations: a brief historical account 



121 




Figure 5.3: The spectacular increase of the accuracy of CMB observations: in the left panel, 
a compilation of pre-WMAP temperature power spectrum measurements obtained between 
1996 (COBE) and 2003 (CBI) is compared with the WMAP first year data in the right panel, 
released in February, 2003. The error-bars give the la uncertainty due to the measurement 
errors, while the shaded region represent the cosmic variance limit. Both figures reprinted 
from Hinshaw et al. (2003a). 

"mosaic" and "deep field", obtained measurements of the temperature power spectrum up to 
£ = 3500 (Sievers et al., 2003; Mason et al., 2003), and it was argued that the excess power 
observed at high multipoles could be due to the SZ effect, from which a precise determination 
of as could possibly be obtained (Bond et al., 2002). The ACBAR experiment, a bolometric 
instrument installed at the South Pole, found small scale power consistent with the results of 
CBI, without however being able to place tighter constraints on its origin (Goldstein et al., 
2003; Kuo et al., 2004). More recently, the results of two years of observations with the 
CBI "mosaic" configuration, give smaller errors in the I ~ 2000 region, due to the longer 
integration time and to an improved absolute calibration derived from the WMAP data, see 
Fig. 5.2. Beside revealing effects due to secondary anisotropies as the SZ effect, the small 
scale measurements are helpful in better constraining n S) T re and possible features in the 
power spectrum (like a "running", i.e. a scale dependence of ra s ) because of the larger lever 
arm they offer when combined with WMAP and large scale structure data (Readhead et al., 
2004). 

The DASI interferometer reported in the second half of 2002 the first detection of E- 
polarization, which was observed on degree angular scales with almost 5a confidence (Kovac 
et al., 2002), thereby opening the epoch of polarization measurements. 

The first year WMAP data, unveiled in February 2003 (Bennett et al., 2003; Hinshaw 
et al., 2003a), essentially confirmed the picture which had emerged from pre-WMAP obser- 
vations, see Fig. 5.3: the height of the first peak was corrected by about 10%, showing more 
power than in the previous data, while the large scale spectrum confirmed the DMR results. 
The second peak is now accurately outlined, while the full four years data should allow to 
obtain good resolution up to I ~ 1000 in temperature. The low power of the quadrupole 
remains troublesome, since it is still not clear whether it is pointing to new physics or just a 
consequence of systematical errors. The observation of the temperature-polarization correla- 
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tion up to i ~ 500 (Kogut et al., 2003) has proved very useful in order to better constrain 
parameters. The exquisite quality of the power spectra has tightened the la likelihood inter- 
vals to a few percent for most cosmological parameters (Spergel et al., 2003), and the central 
value has remained in the region preferred by earlier data, with two interesting exceptions: 
the TE data favor a much larger reionization optical depth than previously thought, and 
there seems to be a slight preference for a "running" (i.e. scale dependent) spectral index 
(Peiris et al., 2003). 

A complete overview of the evolution of data and of the cosmological parameters derived 
from it can be found in the review by Bond et al. (2003). 



Chapter 6 

Beyond standard parameters 



This chapter is devoted to the investigation of three scenarios involving non-standard cosmo- 
logical parameters, and focuses on the ability of constraining them using present and future 
CMB observations: the existence of extra relativistic particles (§ 6.1); the determination of 
the primordial helium mass fraction (§ 6.2); and possible time variations of the fine structure 
constant (§ 6.3). 

Until recently, the effects induced by these parameters on the CMB where considered too 
small to be observable, or else irrelevant; however, the era of precision cosmology that we 
are entering requires on one hand that we check the consequences of our assumptions on the 
standard results for other parameters (as in the case of the neutrino families and the helium 
fraction); on the other hand, it allows us to put under close scrutiny very subtle effects which 
could previously be safely neglected because of the less accuracy of the data sets. 

6.1 Extra relativistic particles 

This section is based on the work published in Bowen et al. (2002), which was carried out 
for the most part during my stay in Oxford. We investigate one possible modification to 
the standard scenario, namely variations in the parameter io ie \ = £l ie \h 2 which describes the 
energy density of relativistic particles. The original work has been performed in 2001, and 
therefore the results presented here of the pre- WMAP data analysis are nowadays somewhat 
outdated. However, the focus is on the degeneracies involving ui re \ and as such the conclusions 
drawn are still valid. Furthermore, the subsequent analysis by several groups of the actual 
WMAP data permits a comparison between the forecasts obtained with the Fisher matrix 
technique in 2001 and the real case, showing a very satisfactory agreement and validating 
the method used. 

After offering the motivations for our study in § 6.1.1, we review various physical mecha- 
nisms that can lead to a change in lo tc \ with respect to the standard value in § 6.1.2. In § 6.1.3, 
we illustrate how the CMB angular power spectrum depends on this parameter and identify 
possible degeneracies with other parameters, then present in § 6.1.4 a likelihood analysis 
from pre- WMAP CMB data and show which of the constraints on the various parameters 
are affected by variations in u> m \. Section 6.1.5 forecasts the precision in the estimation of 
cosmological parameters for the space missions WMAP and Planck, and then compares the 
predictions with actual data analysis performed on the first year WMAP data. 
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6.1.1 Motivation 

CMB data analysis taking into account variations in the density of relativistic particles has 
been previously undertaken by many authors (Hannestad, 2000; Esposito et al., 2001; Kneller 
et al., 2001; Hannestad, 2001; Hansen et al., 2002; Zentner k Walker, 2002), giving rather 
crude upper bounds, which are significantly improved only by including priors on the age 
of the universe or by including supernovae (SN) or large scale structure (LSS) data. It is 
worth emphasizing that there is little difference in the bounds on N e g, the effective number 
of relativistic species, obtained from old and recent CMB data because of the degeneracy 
described in detail below. We focus here on the effects that the inclusion of this parameter, 
a; re i, has on the constraints of the remaining parameters in the context of purely adiabatic 
models. 

As shown below - and as observed previously, see e.g. Hu et al. (1999) - there is a strong 
degeneracy between lo ic \ and the physical density of non-relativistic matter, u m = Q, m h 2 . 
This is important, because an accurate determination of uj m from CMB observations (and of 
Q m by including the Hubble Space Telescope result h = 0.72 ± 0.08) can be useful for a large 
number of reasons. First of all, determining the cold dark matter content, to c dm = u m — uJb can 
shed new light on the nature of dark matter. The thermally averaged product of cross-section 
and thermal velocity of the dark matter candidate is related to u m , and this relation can be 
used to analyze the implications for the mass spectra in versions of the Supersymmetric 
Standard Model, see e.g. Barger & Kao (2001); Djouadi et al. (2001); Ellis et al. (2001). 
The value of J7 m can be determined in an independent way from the mass-to-light ratios of 
clusters, and the present value is 0.1 < f] m < 0.2 (Carlberg et al., 1997; Bahcall et al., 2000). 
Furthermore, a precise measurement of O m will be a key input for determining the redshift 
evolution of the equation of state parameter w(z) and thus discriminating between different 
quintessential scenarios, see e.g. Weller & Albrecht (2002). 

6.1.2 Effective number of relativistic species 

The energy density of relativistic particles can conveniently be parameterized via the effective 
number of relativistic species, N e s: in the standard model co Te \ includes photons and neutrinos, 
and it can be expressed as 



where w 7 is the energy density in photons and uj u is the energy density in one active neutrino 
family. In geometrical units, where G = h = c = 1, one has uj x = 47r 3 /45 • g x T^ 7 where 
g x and T x are the relativistic degrees of freedom and the temperature of species x = 7, v, 
respectively. Measuring u; re i thus gives a direct observation of the effective number of neu- 
trinos, N e g. Naturally there are only three active neutrinos, and N e Q is simply a convenient 
parametrization for the extra possible relativistic degrees of freedom 



Thus ui IC \ includes energy density from all the relativistic particles: photons, neutrinos, and 
additional hypothetical relativistic particles such as a light majoron or a sterile neutrino. 
Such hypothetical relativistic particles are strongly constrained from standard Big-Bang nu- 
cleosynthesis (BBN), where the allowed extra relativistic degrees of freedom typically are 



(6.1) 



N cS = 3 + AN. 



(6.2) 
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expressed through the effective number of neutrinos, N e s = 3 + AAbbn- BBN bounds are 
typically about AtVbbn < 0.2 - 1.0 (Buries et al., 1999; Lisi et al., 1999). 

One should, however, be careful when comparing the effective number of neutrino degrees 
of freedom at the time of BBN (neutrino decoupling) and at the formation of the CMBR 
(photon decoupling). This is because the energy density in relativistic species may change 
from the time of BBN (T ~ MeV) to the time of last rescattering (T ~ eV), as explained 
in Hansen et al. (2002). For instance, if one of the active neutrinos has a mass in the 
range eV < m < MeV and decays into sterile particles such as other neutrinos, majorons 
etc. with lifetime t(BBN) < r < i(CMBR), then the effective number of neutrinos at 
CMBR would be substantially different from the number at BBN (White et al., 1995). Such 
massive active neutrinos, however, do not look very natural any longer in view of the recent 
experimental results on neutrino oscillations (Fogli et al., 2001; Gonzalez-Garcia et al., 2001), 
showing that all active neutrinos are likely to have masses smaller than 0.1 eV. One could 
instead consider sterile neutrinos mixed with active ones which could be produced in the 
early universe by scattering, and subsequently decay. The mixing angle must then be large 
enough to thermalize the sterile neutrinos, and this can be expressed through the sterile to 
active neutrino number density ratio n s jn v « 4-10 4 sin 2 26> (?n/keV)(10.75/<7*) 3 / 2 (Dolgov & 
Hansen, 2002), where 9 is the mixing angle, and g* counts the relativistic degrees of freedom, 
such that n s /n u = 1 or Ag* = 7/8 increases N e s by one unit. With n s /n u of order unity 
we use the decay time, r w 10 20 (keV/?n) 5 / sin 2 29 sec, and one finds, r w 10 17 (keV/m) 4 yr, 
which is much longer than the age of the universe for m ~ keV, so they would certainly 
not have decayed at t(CMBR). A sterile neutrino with a mass of a few MeV would seem to 
have the right decay time, r ~ 10 5 yr, but this is excluded by standard BBN considerations 
(Kolb et al., 1991; Dolgov et al., 1998). More inventive models with particles decaying during 
last rescattering cannot simply be treated with an Nqmb that is constant in time, see e.g. 
Kaplinghat et al. (1999), and we will not discuss such possibilities further here. 

Even though the simplest models predict that the relativistic degrees of freedom are the 
same at BBN and CMB times, one could construct models such as quintessence (Albrecht & 
Skordis, 2000; Skordis & Albrecht, 2002) which effectively could change AN between BBN 
and CMB (Bean et al., 2001). Naturally AN can be both positive and negative. For BBN, 
AN can be negative if the electron neutrinos have a non-zero chemical potential (Kang &: 
Steigman, 1992; Kneller et al., 2001), or more generally with a non-equilibrium electron 
neutrino distribution function (Hansen & Villante, 2000). To give an explicit (but highly 
exotic) example of a different number of relativistic degrees of freedom between BBN and 
CMB, one could consider the following scenario. Imagine another two sterile neutrinos, one 
of which is essentially massless and has a mixing angle with any of the active neutrinos just 
big enough to bring it into equilibrium in the early universe, and one with a mass of m Vs = 3 
MeV and decay time r„ s =0.1 sec, in the decay channel v s — ► v e + (ft, with (ft a light scalar. 
The resulting non-equilibrium electron neutrinos happen to exactly cancel the effect of the 
massless sterile state, and hence we have AAbbn = 0. However, for CMB the picture is much 
simpler, and we have just the stable sterile state and the majoron, hence AAcmb = 1-57. 
For CMB, one can imagine a negative AN from decaying particles, where the decay products 
are photons or electron/positrons which essentially increases the photon temperature relative 
to the neutrino temperature (Kaplinghat & Turner, 2001). Such a scenario also naturally 
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dilutes the baryon density, and the agreement on uij, from BBN and CMB gives a bound on 
how negative AiVcMB can be. Considering all these possibilities, we will therefore not make 
the usual assumption, ATVbbn = AJVcmb, but instead consider AiVcMB as a completely free 
parameter in the following analysis. 

The standard model value for N e g with three active neutrinos is 3.044. This small cor- 
rection arises from the combination of two effects arising around the temperature T ~ MeV. 
These effects are the finite temperature QED correction to the energy density of the elec- 
tromagnetic plasma (Heckler, 1994), which gives AN = 0.01 (Lopez & Turner, 1999; Lopez 
et al., 1999). If there are more relativistic species than active neutrinos, then this effect will 
be correspondingly higher (Steigman, 2001). The other effect comes from neutrinos sharing in 
the energy density of the annihilating electrons (Dicus et al., 1982), which gives AN = 0.034 
(Dolgov et al., 1997; Esposito et al., 2000c). Thus one finds iV cff = 3.044. An accurate 
analysis which takes into account both of this effects simultaneously has been performed by 
Mangano et al. (2002) and the result indicates that the combined effect is slightly smaller, 
JVefj = 3.0395. 



6.1.3 CMB theory and degeneracies 



As explained in detail in Chapter 4, the structure of the Ci spectrum depends on a restricted 
combination of cosmological parameters, which are physically probed by the CMB; simpli- 
fying somewhat the normal parameters set introduced in § 4.2, we focus here on the four 
cosmological parameters 



lo tc \ and 1Z' 



shift 



(6.3) 

= (^cdm + O b )/l 2 , 



the physical baryonic density ujf, = O^/i 2 , the energy density in matter u n 
the energy density in radiation ui re \ and the shift parameter 7£ shlft = £ Ie i/£, which gives the 
position of the acoustic peaks with respect to a flat, Oa = reference model, see Eq. (4.22, 
page 82). In previous analysis (Efstathiou & Bond, 1999; Melchiorri & Griffiths, 2001), the 
parameter ui Te \ has been kept fixed to the standard value, while here we will allow it to vary. 
It is therefore convenient to write 



uj rel = 4.13 • HT 5 (1 + 0.135 • AiVcMB) 



(6.4) 



(taking Tcmb = 2.726 K), where AA^cmb is the excess number of relativistic species with 
respect to the standard model, N e g = 3 + AA^cmb, and we drop the subscript CMB from 
now on. The shift parameter 7£ shlft depends on O m = O c dm + ^b, on the curvature K 
1 — — O m — f^reh and on f2 rc i = ^rcl/h? through 



shift 



1 



VL m x(At) 



Orel + 



On 



1 + Zde 



rel 



(6.5) 



where z^ cc is a function of the physical baryon density and x(Ar) is given in Eq. (1.33, page 
15). Eq. (6.5) generalizes the expression for ft shift given in (4.28, page 83) to the case of 
non-constant O re i. 

By fixing the four parameters given in (6.3), or equivalently the set uJb, the redshift of 
equality z, 



LL> m /uj Ic i, AN and lZ shlft , one obtains a perfect degeneracy for the CMB 
anisotropy power spectra on degree and sub-degree angular scales. On larger angular scales, 



eq 
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Figure 6.1: Left panel: CMB degeneracies between cosmological models. Keeping z eq ,u)b and 
7Z fixed while varying AN produces nearly degenerate power spectra. The reference model 
(solid line) has AiV = 0, Q t ot = 1-00, n s = 1.00; the nearly degenerate model (dotted) has 
AA = 10, f2 t ot — 1-05, n s = 1.00. The curves are normalized to the first peak. The position 
of the peaks is perfectly matched, only the relative height between the first and the other 
acoustic peaks is somewhat different in this extreme example, due to the early ISW effect. 
The degeneracy can be further improved, at least up to the third peak, by raising the spectral 
index to n s = 1.08 (dashed). Right panel: the matter power spectra of the models plotted in 
the top panel together with the observed decorrelated power spectrum from the PSCz survey 
(Hamilton & Tegmark, 2002). The geometrical degeneracy is now lifted. 



the degeneracy is broken by the late ISW effect because of the different curvature and cosmo- 
logical constant content of the models. From the practical point of view, however, it is still 
very difficult to break the degeneracy, since measurements are limited by cosmic variance on 
those scales, and because of the possible contribution of gravitational waves. 

Allowing AA" to vary, but keeping constant the other three parameters u>b, z CCL , and lZ shlit , 
we obtain nearly degenerate power spectra which we plot in Fig. 6.1, normalized to the 
first acoustic peak. The degeneracy in the acoustic peaks region is now slightly spoiled 
by the variation of the ratio fi 7 /fi re i: the different radiation content at decoupling induces 
a larger (for AiV > 0) early ISW effect, which boosts the height of the first peak with 
respect to the other acoustic peaks. Nevertheless, it is still impossible to distinguish between 
the different models with present (pre-WMAP) CMB measurements and without external 
priors. Furthermore, a slight change in the scalar spectral index, n s , can reproduce a perfect 
degeneracy up to the third peak. 

The main result is that, even with a measurement of the first three peaks in the angular 
spectrum, it is impossible to put bounds on uj rc \ alone, even when fixing other parameters 
such as u>b- Furthermore, since the degeneracy is mainly in z eq , the constraints on uj m from 
CMB are also affected, see § 6.1.4. 

In Fig. 6.2 we plot the shift parameter TZ shlit as a function of AA, while fixing Q m = 0.3 
and = 0.7. Increasing AN moves the peaks to smaller angular scales, even though the 
dependence of the shift parameter on AA is rather mild. In order to compensate this effect, 
one has to change the curvature by increasing fl m and Q\. We therefore conclude that the 
present bounds on the curvature of the universe are weakly affected by AN. Nevertheless, 
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Figure 6.2: The shift parameter 7£ shlft as a function of AiV with Qa = 0.7 and J7 m = 0.3. 
The position of the peaks is only weakly affected by AiV. 

when a positive (negative) AiV is included in the analysis, the preferred models are shifted 
toward closed (open) universes. 

6.1.4 Pre-WMAP constraints from CMB and other data-sets 

In this section, we compare pre- WMAP CMB observations with a set of models with cos- 
mological parameters sampled as follows: 0.1 < fi m < 1.0, 0.1 < n ro i/f2 re i(AiV = 0) < 3, 
0.015 < n b < 0.2; < Q A < 1-0 and 0.40 < h < 0.95. We vary the spectral index of 
the primordial density perturbations within the range n s = 0.50, 1.50 and we re-scale the 
fluctuation amplitude by a pre-f actor C\q, in units of C^ OBE . We also restrict our analysis 
to purely adiabatic, flat models (fitot = 1) an d we add an external Gaussian prior on the 
Hubble parameter h = 0.65 ± 0.2. 

Constraints from CMB only 

The theoretical models are computed using the publicly available CMBFAST program (Seljak 
& Zaldarriaga, 1996) and are compared with the BOOMERanG-98, DASI and MAXIMA- 
1 data. The power spectra from these experiments were estimated in 19, 9 and 13 bins 
respectively, spanning the range 25 < t < 1100. We approximate the experimental signal 
C e g inside the bin to be a Gaussian variable, and we compute the corresponding theoretical 
value C l g by convolving the spectra computed by cmbfast with the respective window 
functions. When the window functions are not available, as in the case of Boomerang-98, we 
use top-hat window functions. The likelihood for a given cosmological model is then given 
by 

C = ( C f - Cf)M BB , (C$ - Cf ) (6.6) 
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Figure 6.3: Two-dimensional likelihood plots from analysis of CMB data. 



where (C|f) is the theoretical (experimental) band power and M^b 1 is the Gaussian cur- 
vature of the likelihood matrix at the peak. This expression is a generalization of Eq. (5.30, 
page 104) for the case of correlated experimental points. We consider 10%, 4% and 4% Gaus- 
sian distributed calibration errors (in (j,K) for the BOOMERanG-98, DASI and MAXIMA-1 
experiments respectively. We also include the COBE data using Lloyd Knox's RADPack 
package (RADPack Website, 2001). 

In order to show the effect of the inclusion of u; re i on the estimation of the other parameters, 
we plot likelihood contours in the uj re \ — u m , lu tc \ — a;&, ui TC \ — n s planes. Proceeding as in 
Melchiorri et al. (2000), we calculate a likelihood contour in those planes by maximizing 
the other parameters as explained in § 5.1.5. In Fig. 6.3 we plot the likelihood contours 
for u) TC \ vs uJmjUJf, and n s . As can be seen, uj rc \ is very weakly constrained to be in the 
range 1 < u; rc i/u; rc i(A./V = 0) < 1.9 at la I.e. in all plots 1 . The degeneracy between uj rc \ 
and uj m is evident in the left panel of Fig. 6.3. Increasing oj {C \ shifts the epoch of matter- 
radiation equality and this can be compensated only by a corresponding increase in oj m . It is 
interesting to note that even if we are restricting our analysis to flat models, the degeneracy 
is still present and that the bounds on oj m are strongly affected. We find cj m = 0.2 ±0.1, to 
be compared with oj rn = 0.13 ±0.04 when AN is kept to zero. It is important to realize that 
these bounds on oj tc \ appear because of our prior on h and because we consider flat models. 
When one allows h and fl m to be free parameters, then the degeneracy is almost complete 
and there are no bounds on u) rc \. 

In the central and right panel of Fig. 6.3 we plot the likelihood contours for u>b and n s . As 
we can see, these parameters are not strongly affected by the inclusion of uj re \. The bound 
on u>b, in particular, is completely unaffected by u> re \. There is however, a small correlation 
between u; rc i and n s : the boost of the first peak induced by the ISW effect can be compensated 
(at least up to the third peak) by a small change in n s (right panel). 

Since the degeneracy is mainly in z eq , it is useful to estimate the constraints we can put 
on this variable. In Fig. 6.4 we plot the likelihood curve on z eq alone obtained by maximizing 
over all other parameters. By integration of this probability distribution function we obtain 

z eq = 3100+^ at 68% I.e. (6.7) 



1 Here as in the following, the abbreviation "I.e." stands for "likelihood content", in the Bayesian sense 
explained in § 5.1.5. 
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Figure 6.4: Likelihood probability distribution function for the redshift of equality. 
Adding other data-sets 

It is interesting to investigate how well the constraints from CMB-independent data-sets 
can break the degeneracy between uj re \ and u m . The supernovae luminosity distance is very 
weakly dependent on uj tc \ - see however Zentner & Walker (2002) - and the bounds obtained 
on f2 m can be used to break the CMB degeneracy. Including the SN-Ia constraints on the 
- plane, 0.8O m - 0.6O A = -0.2 ± 0.1 (Perlmutter et al., 1999), we find 



W rc l/Wrcl(AiV = 0) = 1.12 



0.35 
-0.42 



at 2cj% I.e. 



(6.* 



It is also worth including constraints from galaxy clustering and local cluster abundances. 
The degeneracy between uj m and w re i in the CMB cannot be broken trivially by inclusion of 
large-scale structure (LSS) data, because a similar degeneracy affects the LSS data as well 
(Hu et al., 1999). However, the geometrical degeneracy is lifted in the matter power spectrum, 
and accurate measurements of galaxy clustering at very large scales can distinguish between 
various models. This is exemplified in the right panel of Fig. 6.2, where we plot three matter 
power spectra with the same cosmological parameters as in the top panel, together with the 
decorrelated matter power spectrum obtained from the PSCz survey. 

The shape of the matter power spectrum in the linear regime for galaxy clustering can be 
characterized by the shape parameter 



r 



-(n 6 (i+V2Vn m )-o.o6) 



(6.9) 



V1 + 0-135AN ' 

From the observed data one has roughly (Bond & Jaffe, 1999) 0.15 < T + (n s — l)/2 < 0.3. 
The inclusion of this (conservative) value on T gives 

w rc i/u; r ci(AiV = 0) = 1.40^o 9 56 at 2a% I.e. (6.10) 



a bound which is less less restrictive than the one obtained using the SN-Ia prior. 

A better constraint can be obtained by including a prior on the variance of matter per- 
turbations over a sphere of size 8/i _1 Mpc, derived from cluster abundance observations. 
Comparing with ag = (0.55 ± 0.05)O~° , we obtain 



Wrcl/Wrcl(AiV = 0) = 1.27' 



0.35 
-0.43 



at 2cr% I.e. 



(6.11) 
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WrclMcl(AiV = 0) 



iVcff 



CMB only 
CMB + SN-Ia 
CMB + PSCz 
CMB + a 8 




0.04... 13.37 
0.78... 6.48 



1.81... 9.59 



1.82... 7.59 



Table 6.1: Data analysis results: 2a likelihood intervals on the effective energy density of rel- 
ativistic particles, w re i/w re i(AiV = 0), and on the corresponding effective number of neutrino 
species, N c s, for different data set combinations. Note that the bounds obtained with CMB 
data only mainly reflect the priors used in the analysis. 

Our results are summarized in Table 6.1. Combination of present day CMB data with 
SN and with LSS data yields a lower bound N c g > 0.8 and > 1.8, respectively, with 2a 
likelihood content. Our result is in good agreement with the analysis of Hannestad (2001), 
which considered similar data sets. It is worth emphasizing the fact that N c s = is excluded 
at much more than 2a: this can be considered as a strong cosmological evidence of the 
presence of a neutrino background, as predicted by the Standard Model. The upper bounds 
for the combined sets can be expressed as N c s < 6.5 for CMB+SN and iV c g < 9.6 for 
CMB+LSS, at 2a I.e. 

6.1.5 Fisher matrix forecast 

In this section we perform a Fisher matrix analysis with the technique explained is § 5.2 in 
order to estimate the precision with which forthcoming satellite experiments will be able to 
constrain the parameter z cq . 

Table 6.2 summarizes the experimental parameters for WMAP and Planck employed in 
the analysis, which considers temperature information only. For both experiment we take 
a sky coverage / s k y = 0.50. These values are indicative of the expected performance of the 
experimental apparatus, but the actual values may be somewhat different, especially for the 
Planck satellite. 

As base parameters for the Fisher matrix analysis, we use the following nine dimensional 
parameter set: 



to scalar ratio at the quadrupole. We adopt a phenomenological normalization parameter, 
given by 



so that Q effectively measures the mean power seen by the experiment. The shift parameter 
7£ shlft , including the radiation content as in Eq. (6.5) takes into account the geometrical 
degeneracy. Our purely adiabatic reference model has parameters: Uf, = 0.0200 (0& = 
0.0473), u c = 0.1067 (fi c = 0.2527), uj a = 0.2957 (fi A = 0.7000), (h = 0.65), K shm = 0.953, 
z eq = 3045, n s = 1.00, rit = 0.00 , r = 0.10, Q = 1.00. This is a fiducial, concordance model, 
which we believe to be in good agreement with most recent determinations of the cosmological 



= {u b ,uj c ,uj A ,TZ shlit ,z eq ,n s ,nt,r,Q} . (6.12) 
Here n s , n t are the scalar and tensor spectral indices respectively and r = Crf/Cf is the tensor 




(6.13) 
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WMAP 


Planck 


v (GHz) 
6 C (degrees) 

<r c /i(r 6 
w-710- 15 

tc 


40 60 90 
0.46 0.35 0.21 
6.6 12.1 25.5 
2.9 5.4 6.8 
289 385 642 


100 150 220 350 
0.18 0.13 0.09 0.08 
1.7 2.0 4.3 14.4 
0.028 0.022 0.047 0.44 
757 1012 1472 1619 


p 


1500 


2000 



Table 6.2: Experimental parameters used in the Fisher matrix analysis for WMAP (first 3 
channels) and Planck (last 4 channels). 

parameters (flat universe, scale invariant spectral index, BBN compatible baryon content, 
large cosmological constant). Furthermore, we allow for a modest, 10% tensor contribution 
at the quadrupole in order to be able to include tensor modes in the Fisher matrix analysis. 

We plot the derivatives of Ci with respect to the different parameters in Fig. 6.5. Generally, 
we note that derivatives with respect to the combination of parameters describing the matter 
content of the universe (w& and uj c , 1Z shlft , z cq ) are large in the acoustic peaks region, I > 100, 
while derivatives with respect to parameters describing the tensor contribution (nt, r) are 
important in the large angular scale region. Since measurements in this region are cosmic 
variance limited, we expect uncertainties in the latter set of parameters to be large regardless 
of the details of the experiment. The curve for dC^/dQ is of course identical to the Cg's 
themselves. The cosmological constant is a notable exception: variation in the value of u>\ 
keeping all other parameters fixed produces a perfect degeneracy in the acoustic peaks region. 
Therefore we expect the derivative dCt/duj\ to be zero in this region. Small numerical errors 
in the computation of the spectra, however, artificially spoil this degeneracy, erroneously 
leading to smaller predicted uncertainties. In order to suppress this effect, we set dCt/duj\ = 
for i > 200. From Eq. (5.76, page 117) we see that a large absolute value of dCi/d6i 
leads to a large Fa and therefore to a smaller la error (roughly neglecting non-diagonal 
contributions). If the derivative along 6{ can be approximated as a linear combination of the 
others, however, then the corresponding directions in parameter space will be degenerate, and 
the expected error will be important. This is the case for mild, featureless derivatives such 
as dCi/dr, while strongly varying derivatives (such as dCe/dTZ shlit ) induce smaller errors in 
the determination of the corresponding parameter. Therefore the choice of the parameter set 
is very important in order to correctly predict the standard errors of the experiment. 

Error forecast 

The quantity e,; = l/y/\, where A, is the z-th eigenvalue of the Fisher matrix, is sometimes 
used as a rough indication of the resolving power of an experiment. It expresses the accu- 
racy with which the i-th eigenvector of the Fisher matrix can be determined. The principal 
components describe to a good approximation which linear combinations of the cosmological 
parameters can be directly measured with the CMB. In fact, they represent linear approxi- 
mations to the orthogonal normal parameters introduced in § 4.2. For WMAP (Planck) the 
number of eigenvectors with 6j < 10~ 3 is 1 out of 9 (3 out of 9) and with ej < 10~ 2 is 3/9 
(6/9). 



6.1 Extra relativistic particles 



133 



Parameter 




WMAP 


Planck 


RpHshift of pnnalitv 

J- I V Vil li 1 1 -L KJ Ul V^VJ ll(A 1 1 Vt y 


fiz„~ / Z™ 


0.23 


0.02 


Relativistic energy 




0.43 


0.03 


Effective v families 


AAU- 

l — x± Y etc 


3.17 


0.24 


Baryons density 


5(jJh 1 LOh 

^ M 1 


0.12 


< 0.01 


CDM density 


5uj c /uj c 


0.50 


0.04 


Cosmological constant 


5lj a /uj a 


3.40 


1.71 


Shift parameter 


5K shiit 


< 0.01 


< 0.01 


Scalar spectral index 


5n s 


0.15 


0.01 


Tensor spectral index 


5n t 


1.96 


1.08 


Scalar-to-tensor ratio 


br jr 


5.22 


2.67 


Normalization 


bQ 


0.01 


< 0.01 



Table 6.3: Fisher matrix analysis results: expected la errors for the WMAP and Planck 
satellites. See the text for details and discussion. 

Table 6.3 shows the results of our analysis for the expected la error on the physical 
parameters. Determination of the redshift of equality can be achieved by WMAP with 
23% accuracy, while Planck will pinpoint it down to within 2% relative error. From lo tc \ = 
(uJb+u> c )/ z cq it follows that the energy density of relativistic particles, u re i, will be determined 
within 43% by WMAP and 3% by Planck. This translates into an impossibility for WMAP 
alone of measuring the effective number of relativistic species (AN e ff « 3.17 at la), while 
Planck will be able to track it down to AN e Q « 0.24. As for the other parameters, while the 
acoustic peak' positions (through the value of 1Z shlft ) and the matter content of the universe 
can be determined by Planck with high accuracy (of the order of or less than one percent), 
the cosmological constant remains (with CMB data only) almost undetermined, because of 
the effect of the geometrical degeneracy. One could also see this as a consequence of an 
inappropriate parameterization of the problem: we should in fact use the parameters which 
the physics of the CMB measures best, i.e. the principal components. The scalar spectral 
index n s and the overall normalization will be well constrained already by WMAP (within 
15% and 1%, respectively), while because of the reasons explained above the tensor spectral 
index nt and the tensor contribution r will remain largely unconstrained by both experiments. 
Generally, an improvement of a factor ten is to be expected between WMAP and Planck in 
the determination of most cosmological parameters. 

Our analysis considers temperature information only. Inclusion of polarization measure- 
ments would tighten errors, especially for the "primordial" parameters n s ,n t and r (Zal- 
darriaga et al., 1997; Bucher et al., 2001). This is especially important for a WMAP-type 
experiment, since a precise determination of n s and an higher accuracy in oj m would greatly 
improve the precision on N e s which can be obtained with temperature only. By the time 
Planck will obtain his first results, polarization measurements will hopefully have been per- 
formed. Combination of polarization information with the WMAP temperature data would 
then considerably improve the precision of the extracted parameter values. 

A Fisher matrix analysis for AA e fj was previously performed by Lopez et al. (1999) 
and repeated by Kinney & Riotto (1999) with the equivalent chemical potential £, AN = 
15/7(2(£/vr) 2 + (C/vr) 4 ), and a strong de generacy was found between N e R,h and Qa, and to 
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lesser extent with 0&. We have seen here that the degeneracy really is between u ie \,u m and 
n s , and the degeneracy previously observed is thus explained because they considered flat 
models, where a change in Qa is equivalent to a change in u m , uj m = (1 — f^A — Qb)h 2 . The 
results regarding how precisely the future satellite missions can extract the relativistic en- 
ergy density, can be translated into approximately AN c s = 3.17 (£ = 2.4) and AN C ^ = 0.24 
(£ = 0.73) for WMAP and Planck respectively. However, including neutrino oscillation leads 
to equilibration of the different chemical potentials, and hence BBN leads to the stronger 
bound |£| < 0.07 for all neutrino species (Dolgov et al., 2002). 

Comparison with WMAP data analysis 

After the release of the WMAP first year observations, several groups have independently 
carried out an analysis similar to the one presented above (Crotty et al., 2003b; Hannestad, 
2003; Pierpaoli, 2003). Unfortunately, none of these works includes tensor modes as in our 
forecasts, and one has to keep in mind that the FMA assumed temperature information only 
and experimental parameters as appropriate for the original mission specifications, which 
may be slightly different from the effective parameters for the first year only Despite the fact 
that the details of the data included and the prior assumptions vary for each work, the overall 
agreement of their findings with our forecasts for WMAP is nonetheless very satisfactory We 
briefly review their conclusions and compare them with the above predictions. 

In Crotty et al. (2003b) the la error on N eS is found to be AiV efl = 3.4 using WMAP data 
only (but including the TE-spectrum) and a weak top-hat prior on the Hubble parameter, 
0.5 < h < 0.9, with the analysis limited to flat models only. This result has to be contrasted 
with the prediction above, which for the full WMAP data gives (at la) AN c g = 3.17. As 
predicted, the WMAP observations improve dramatically on the bounds for N e g from CMB 
only, which become with the above assumptions —2.1 < AAcfj- < 6.9 (at 2a likelihood 
content). 

These findings are in good concordance with the more general set-up of Pierpaoli (2003), 
where curved models are considered as well, the CBI data are used together with the WMAP 
observations and constraints from the 2dF matter power spectrum are also included. In this 
case the results do not compare directly with our predictions because of the inclusion of 
external constraints in the form of the matter power spectrum. The 95% likelihood interval 
is then tighter because of the more powerful observational data used, giving (without Hubble 
prior) AApfj = 5.5. 

The quite complete investigation of Hannestad (2003) also derives constraints on the 
neutrino masses, and considers the effects of the inclusion of further observational constraints, 
such as a prior on the Hubble parameter, a prior on Q m from supernova? data, a BBN prior 
on u>b and the 2dF matter power spectrum. Where comparable, the findings are entirely 
compatible with the other two works; in particular, for the case of massless neutrinos and 
WMAP data only, the 95% likelihood interval for flat models only and a weak top-hat prior 
0.5 < h < 0.85 is AA" eff = 8.9. 



6.1 Extra relativistic particles 



135 



10-' - 

8x10"" - 

BxIO- 1 f 

4xI0-« - 

M 



+ 

.Si, 

-"4 



2x10-" 
h 
-2xlD-» |- 
-4xlQ-« \- 
-8xtQ 
-8x10"" t 
-1x10-' P- 
1.3xIQ-« f r 
1.4X10-" - 



0.1w t 



eq 



CJ, 




1000 



w 



u 
to 



+ 

.Si, 

">4 




Multipole I 



Multipole I 




10 100 1000 

Multipole t 



Figure 6.5: Derivatives of Cg, with respect to the 9 parameters evaluated at the reference 
model described in the text. The numerical prefactor indicates that the corresponding curve 
has been rescaled: thus O.lu;;, means that the displayed curve is 0.1 • dCi/duJb. The derivative 
dCg/duiA has been set to for i > 200 in order to suppress the effect of numerical errors, 
thus taking into account the geometrical degeneracy. 
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6.2 The primordial helium fraction 

This section is based on the work Trotta & Hansen (2004), where the first determination 
of the helium abundance from CMB data alone was presented. After giving the motivation 
underlying this investigation in § 6.2.1, we discuss in § 6.2.2 the role of the helium mass 
fraction for CMB anisotropies, and in particular the details of the ionization history of the 
Universe which are relevant for constraining the helium abundance with the CMB. We then 
review the standard Big-Bang Nucleosynthesis scenario for the abundance of light elements 
and compare its predictions with current astrophysical measurements in § 6.2.3; the present 
constraints from CMB data are presented in § 6.2.4, while the future potential of using the 
CMB as an independent way of determining the helium abundance is elucidated in § 6.2.5. 
There we also explore the impact of helium for future accurate determination of the baryon 
abundance. 



6.2.1 Motivation 

Our understanding of the baryon abundance has increased dramatically over the last few 
years, coming from two independent paths, namely BBN and CMB. Absorption features 
from high-redshift quasars allow us to measure precisely the deuterium abundance, D/H, 
which combined with BBN calculations provides a reliable estimate of the baryon to photon 
ratio, 

r/io^-10 10 . (6.14) 

n 7 

An independent determination of the baryon content of the universe from CMB anisotropies 
comes from the increasingly precise measurements of the acoustic peaks, via the character- 
istic signature of the photon-baryon fluid oscillations discussed in § 4.1.2.2. The agreement 
between these two completely different approaches is both remarkable and impressive (see 
details below). The time is therefore ripe to proceed and test the agreement between other 
light elements which are also probed both by BBN and CMB. 

Helium being the most abundant of the light elements, it is natural to focus on this element 
by exploring the dependency of CMB anisotropies on the value of the primordial helium mass 
fraction Y p , defined as 

Y p = ^, (6.15) 
n b 

where rtHe and denote the number densities of 4 He atoms and baryons, respectively. If we 
denote by and np the number densities of neutrons and protons, respectively, and assume 
that all neutrons are in He nuclei, then a simple counting argument gives the estimate 

2n N /n P ^ 
1 + n N /n P 

where the numerical value comes from a rough approximation to the freeze-out value of the 
neutron to proton ratio n^/rip « 1/7, see e.g. Kolb & Turner (1990). The detailed value of 
Y p is predicted by BBN as a function of two parameters only, the baryon abundance and the 
number of relativistic degrees of freedom at BBN (Fields k, Sarkar, 2004). 

The hope is that the CMB observations might provide an independent measurement of 
Y p , accurate enough to help clarify the present-day discrepancies between direct observations 
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of the helium fraction derived from astrophysical systems, whose errors are seemingly domi- 
nated by systematics which are hard to assess. The latest CMB data are precise enough to 
allow taking this further step, and in view of the emerging "baryon tension" between BBN 
predictions from observations of different light elements (Cyburt et al., 2003) possibly requires 
taking such a step. The advantage of using CMB anisotropics rather than the traditional 
astrophysical measurements, is that the CMB provides a clear measurement of the primordial 
helium fraction before it could be changed by any astrophysical process. On the other hand 
the dependence of the CMB power spectrum on the primordial helium fraction is rather mild, 
a fact which makes it presently safe to set the helium mass fraction to a constant for the 
purpose of CMB data analysis of other cosmological parameters, but will have an impact on 
the baryon abundance determination from Planck quality data, as we show in § 6.2.5. 

6.2.2 The impact of helium on the CMB: ionization history revisited 

We now resume our discussion of the recombination epoch and reionization history of the 
Universe sketched in § 4.1.3, and focus on the role of the helium mass fraction, considered here 
as a free parameter. In a second step, the aim will be to combine the CMB results with the 
BBN predictions and compare the result with the independent astrophysical determinations 
of the light elements abundance. We thus have at our disposal three different tools, each of 
which probes the same quantities at three vastly different epochs of the cosmic history. It 
is important to stress that a good agreement among the three is by no means trivial, and 
that testing their concordance is a powerful way to check the consistency of the standard 
cosmological scenario. On the other hand, significant discrepancies would necessary imply 
the need for new physics. 

The recent WMAP data allow us to determine with very high precision the epoch of pho- 
ton decoupling, z^ ec , i.e. the epoch at which the ionized electron fraction, x e (z) = n e /riH, 
has dropped from 1 to its residual value of order 10 . Here n e denotes the number density 
of free electrons, while nu is the total number density of H atoms (both ionized and recom- 
bined). The redshift of decoupling has been determined to be Zd ec = 1088^<j> (Spergel et al., 
2003), which corresponds to a temperature of about 0.25 eV. Helium recombines earlier than 
hydrogen, roughly in two steps: around redshift z = 6000 Helll recombines to Hell, while 
Hell to Hel recombination begins around z < 2500 and finishes just after the start of H 
recombination (Libarskii & Sunyaev, 1983; Hu et al., 1995; Seager et al., 1999, 2000). 

The baryon number density per m 3 n b {z) is related to the baryon energy density today, 
Ub, by 

n b = 11.3(1 + zfuj b (6.17) 

and we have nn = ^(l — Y p ). Usually, the ionization history is described in terms of 
x e (z) = n e /(nb(l — Y p )). However, for the purpose of discussing the role of Y p , it is more 
convenient to consider the quantity 

f e (z) = n e /n b (6.18) 

instead, the ratio of free electrons to the total number of baryons. For brevity, we will call 
f e the free electron fraction. Once the baryon number density has been set by fixing uj b . 
one can think of Y p as an additional parameter which controls the number of free electrons 
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Figure 6.6: Evolution of the number density of electrons normalized to the number density 
of baryons, f e = n e /rii, 7 as a function of redshift for different values of the helium fraction 
Y p . The black-solid curve corresponds to the standard value Y p = 0.24. The labels (a) to (d) 
indicate the four different phases discussed in the text. 

available in the tight coupling regime. The CMB power spectrum depends on the full detailed 
evolution of the free electron fraction, but we can qualitatively describe the role of helium in 
four different phases of the ionization/recombination history, displayed in Fig. 6.6. 

(a) Before Helll recombination, all electrons are free, therefore f e (z > 6000) = 1 — Y p /2. 

(b) Hell progressively recombines and just before H recombination begins, f e has dropped 



to the value f e (z » 1100) =1-Y p . 

(c) After decoupling, a residual fraction of free electrons freezes out, giving / e (30 < z < 800) = 
/f s « 2.7 • 10" 5 ^M>- 




(d) Reionization of all the H atoms gives f e (z < 20) = 1 — Y p . 

During phase (a), the photon-baryons fluid is in the tight coupling regime. However 
the presence of ionized He increases diffusion damping, therefore having an impact on the 
damping scale in the acoustic peaks region: the diffusion damping length (3.120, page 66) 
including helium can be approximated as (Hu & Sugiyama, 1995a) 



As expected, a larger helium fraction implies an increased damping length, and thus an extra 
power suppression on small scales. 





(6.19) 
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When the detailed energy level structure of Hell is taken into account (Seager et a!., 2000), 
the transition to phase (b) is smoother than in the Saha equation approximation. Therefore 
the plateau with f e = 1 — Y p is not visible in Fig. 6.6. Before H recombination, He atoms 
remain tightly coupled to H atoms through collisions, with the same dynamical behavior. In 
particular, it is the total u>b which determines the amount of gravitational pressure on the 
photon-baryons fluid, and which sets the acoustic peak enhancement/suppression, see § 4.1.2. 
Hence we do not expect the value of Y p to have any influence on the boosting (suppression) 
of odd (even) peaks. The redshift of decoupling (transition between (b) and (c)) depends 
mildly on Y p in a correlated way with ojb, since the number density of free electrons in the 
tight coupling regime (just before H recombination) scales as n e = f e rib = n&(l — Y p ). Hence 
an increase in uif, can be compensated by allowing for a larger helium fraction. An analytical 
estimate along the same lines as in e.g. Kolb & Turner (1990) indicates that a 10% change in 
Y p affects z^ ec by roughly 0.1%, which corresponds to Az^ec ~ 1- This is of the same order 
as the current la errors on z^ ec , obtained by fixing Y p = 0.24. 

After H recombination, the residual ionized electron fraction /g es does not depend on Y p , 
but is inversely proportional to the total baryon density (phase (c)). As the CMB photons 
propagate, they are occasionally rescattered by the residual free electrons. The corresponding 
optical depth, r res is given by 



Performing the integral we can safely neglect the contribution of the cosmological constant 
at small redshift, since z& ec S> VLj^/VL m . Retaining only the leading term, the approximated 
optical depth from the residual ionization fraction is estimated to be 



independent of the cosmological parameters and of the helium fraction. Therefore after last 
scattering we do not expect any significant effect on CMB anisotropics coming from the 
primordial helium fraction, until the reionization epoch, phase (d). 

As pointed out in § 4.1.3.2, CMB anisotropies are sensitive only to the integrated reionized 
fraction if temperature information only is available, while specific signatures are imprinted 
on the E-polarization and ET-cross correlation power spectra by the detailed shape of the 
reionization history. There are several physically motivated reionization scenarios, which how- 
ever cannot be clearly distinguished at present (Haiman & Holder, 2003; Hansen & Haiman, 
2004). Therefore at the present level of accuracy it is safe for our purpose to assume an abrupt 
reionization, i.e. that at the reionization redshift z re all the hydrogen was quickly reionized, 
thus producing a sharp rise of n e from its residual value to njj. More precisely, z Te is the 
redshift at which x e (z re ) = 0.5. In our treatment we neglect Hell reionization, for which there 
is evidence at a redshift z ~ 3 (see Theuns et al., 2002 and references therein). This effect is 
small, since one extra electron released at z « 3 would change the reionization optical depth 
only by about 1%. The effect of Helll reionization, which happens still later, is even smaller. 
We also neglect the increase of the helium fraction due to non-primordial helium production, 




(6.20) 
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1.24 • 10" 6 (1 + z dec ) 3/2 ~ 0.045 



(6.21) 
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Figure 6.7: CMB temperature (left panel) and polarization (right panel) power spectra and 
percentage change (bottom panels) for a 10% larger (smaller) value of the helium mass 
fraction, Y p . The solid-black line in the top panels corresponds to a standard ACDM model, 
with Y p = 0.24. The impact is at the percent level, and is almost indistinguishable in the 
top panels. All other parameters are fixed to the value of our fiducial model (Table 6.4), in 
particular, we have T re = 0.166. 



which has a negligible effect on CMB anisotropies. Those approximations do not affect the 
results at today's level of sensitivity of CMB data: for WMAP noise levels, even inclusion of 
the polarization spectra is not enough to distinguish between a sudden reionization scenario 
and a more complex reionization history. At the level of Planck a more refined modelling of 
the reionization mechanism will be necessary (Holder et al., 2003; Doroshkevich et al., 2003). 

In the sudden reionization scenario adopted here, the relation between reionization redshift 
and reionization optical depth, r re , is given by Eq. (4.38, page 87). Once again, since the 
number density of reionized electrons scales as u>b(l — Y p ), the redshift of reionization is 
positively correlated with Y p (for fixed optical depth and baryon density). 

As a result of the physical mechanism described above, a 10% change in Y p has a net impact 
on the CMB power spectrum at the percent level. The impact on the CMB temperature and 
polarization power spectra is highlighted in Fig. 6.7. In the temperature panel, we notice 
that a larger helium fraction slightly suppresses the peaks because of diffusion damping, 
while it has no impact on large scales. Polarization is induced by the temperature quadrupole 
component at last scattering and the reionization bump induced in the polarization spectrum 
(see § 4.1.3.2) is clearly visible in the polarization panel of Fig. 6.7 in the I « 15 region. A 
change in the helium fraction implies a shift of the redshift of reionization for a given (fixed) 
optical depth, and a consequent shift of the position of the reionization bump via Eq. (4.41, 
page 87). The value of Y p does not affect the height of the bump, which is controlled by the 
optical depth and is proportional to r 2 . This effect is highlighted in the polarization panel of 
Fig. 6.7: a 10% change in Y p induces roughly a 10% change in the position of the bump. The 



6.2 The primordial helium fraction 



141 



subsequent two oscillatory features for i < 50 reflect the displacement of further secondary 
reionization induced polarization oscillations. However, since the value of polarized power is 
very low in that region, such secondary oscillations are very hard to detect precisely. 

In principle, given an accurate knowledge of the reionization history, the effect of Y p on 
the polarization bump would assist in the determination of the helium abundance. However, 
our ignorance of the reionization history prevents us from recovering useful information out 
of the measured reionization bump. The displacement induced by Y p is in fact degenerate 
with a partial reionization, or with other, more complex reionization mechanisms. Hence 
constraints on Y p come effectively from the damping tail in the £ > 400 region of the temper- 
ature spectrum, which needs to be measured with very high accuracy. Other light elements 
like deuterium and helium-3 are much less abundant, and will therefore have even smaller 
effect on the CMB power spectrum, at the order of 10~ 5 . 

6.2.3 Astrophysical measurements and BBN predictions 

Once we fix the number of relativistic degrees of freedom by specifying the number of massless 
neutrino families, the standard model of Big-Bang Nucleosynthesis (BBN) has only one free 
parameter, namely the baryon to photon ratio 7710 defined in (1.47, page 19), which for long 
has been known to be in the range 1 — 10 (Kolb & Turner, 1990). Thus by observing just 
one primordial light element one can predict the abundances of all the other light elements. 

Astrophysical measurements 

The deuterium to hydrogen abundance, D/H, is observed by Ly-a features in several quasar 
absorption systems at high redshift, D/H = 2.78~^q'^ x 10~ 5 (Kirkman et al., 2003), which 
in BBN translates into the baryon abundance, r/io = 5.9 ±0.5. Using BBN one thus predicts 
the helium mass fraction to be in the range 0.2470 <Y P < 0.2487. The dispersion in various 
deuterium observations is, however, still rather large, ranging from D/H = 1.65 ± 0.35 x 
10^ 5 (Pettini & Bowen, 2001) to D/H = 3.98±^|| x 10~ 5 (Kirkman et al., 2003), which most 
probably indicates underestimated systematic errors. 

The observed helium mass fraction comes from the study of extragalactic HII regions 
in blue compact galaxies. A careful study by Izotov & Thuan (1998) gives the value Yp = 
0.244±0.002; however, also here there is a large scatter in the various observed values, ranging 
from Y p = 0.230 ±0.003 (Olive et al., 1997) over Y p = 0.2384 ±0.0025 (Peimbert et al., 2002) 
and Y p = 0.2391 ±0.0020 (Luridiana et al., 2003) to Y p = 0.2452 ±0.0015 (Izotov et al., 1999). 
Besides the large scatter there is also the problem that the helium mass fraction predicted 
from observations of deuterium combined with BBN, 0.2470 < Y p < 0.2487, is larger than 
(and seems almost in disagreement with) most of the observed helium abundances, which 
probably points towards underestimated systematic errors, rather than the need for new 
physics (Cyburt et al., 2003; Barger et al., 2003b). Figure 6.8 is a compilation of the above 
measurements, and offers a direct comparison with the current (large) errors from CMB 
observations, presented in § 6.2.4, and with the potential of future CMB measurements, 
discussed in § 6.2.5. 

The observed abundance of primordial 7 Li using the Spite plateau is possibly spoiled by 
various systematic effects (Ryan et al., 2000; Salaris h Weiss, 2001). Therefore it is more 
appropriate to use the BBN predictions together with observations to estimate the depletion 
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Figure 6.8: In the top panel we plot a few current direct astrophysical measurements of the 
helium mass fraction Y p as Gaussian likelihood curves with standard deviation corresponding 
to the given la (statistical) error (blue/dark gray curves, on the left of the diagram), and the 
value inferred from deuterium measurements combined with BBN (yellow/light gray curve, 
on the far right), see the text for references. In the bottom panel, a direct comparison with 
CMB present-day accuracy (actual CMB data, black dashed line, this work; the la likelihood 
interval is 0.16 < Y p < 0.50) and with its future potential (Fisher matrix forecast for Planck 
- green/light gray curve - and a Cosmic Variance Limited experiment - orange/dark gray 
curve) . 

factor fj = 7 Li bs/ 7 Li pr i m instead of using Li b s to infer the value of 7710 (Buries et al., 2001; 
Hansen et al., 2002). 

The numerical predictions of standard BBN (as well as various non-standard scenarios) 
have reached a high level of accuracy (Lopez & Turner, 1999; Esposito et al., 2000a,b; Buries 
et al., 2001), and the precision of these codes is well beyond the systematic errors discussed 
above. 

BBN and the need for new physics 

If the CMB-determined helium mass fraction turns out to be as high as suggested by BBN 
calculations together with the CMB observation of l^/i 2 (as discussed above), this could 
indicate a systematic error in the present direct astrophysical helium observations. 

Alternatively, if the CMB could independently determine the helium value with sufficient 
precision to confirm the present helium observations, then this would be a smoking gun for 
new physics. In fact, one could easily imagine non-standard BBN scenarios which would 
agree with present observations of 7710, while having a low helium mass fraction. All what 
is needed is additional non-equilibrium electron neutrinos produced at the time of neutrino 
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decoupling which would alter the n — p reaction. This could alter the resulting helium mass 
fraction while leaving the deuterium abundance unchanged. One such possibility would be a 
heavy sterile neutrino whose decay products include v e . A sterile neutrino with life-time of 
1 — 5 sec and with decay channel v s — ► v e + <f> with <p a light scalar (like a majoron), would 
leave the deuterium abundance roughly untouched, but can change the helium mass fraction 
between AY p = —0.025 and AY p = 0.015 if the sterile neutrino mass is in the range 1 — 20 
MeV (Dolgov et al., 1999). A simpler model would be standard neutrino oscillation between 
a sterile neutrino and the electron neutrino. The lifetime is about 1 sec when the sterile state 
has mass about 10 MeV, and the decay channel is v s — ► v e + I + / (with I any light lepton), 
and such masses and life-times are still unconstrained for large mixing angle (Dolgov et al., 
2000). Related BBN issues are discussed by Shi et al. (1999); Di Bari k Foot (2001); Kirilova 
(2003). Such possibilities are hard to constrain without an independent measurement of the 
helium mass fraction. 

Another much studied effect of neutrinos is the increased expansion rate of the universe if 
additional degrees of freedom are present (for BBN), and the degeneracy between the total 
density in matter and relativistic particles (for CMB), which is presented in detail in § 6.1. 
The more general set-up would then be to allow N e g as a further free parameter both in the 
CMB and BBN analysis, but because of the very weak dependence of the CMB on Y p this 
would spoil any hope of being able to constrain the helium fraction with the CMB; therefore 
we choose to fix N e g = 3.04. 

Also, an electron neutrino chemical potential could potentially alter the BBN predic- 
tions (Kang & Steigman, 1992; Lesgourgues & Pastor, 1999), however, with the observed 
neutrino oscillation parameters the different neutrino chemical potentials would equilibrate 
before the onset of BBN (Dolgov et al., 2002; Wong, 2002; Abazajian et al., 2002), hence 
virtually excluding this possibility (see however Barger et al., 2003a). 

6.2.4 WMAP Monte Carlo analysis 

We use a modified version of the publicly available Markov Chain Monte Carlo package 
COSMOMC as described in Lewis & Bridle (2002) in order to construct Markov chains (see 
§ 5.1.7) in our seven dimensional parameter space. We sample over the following set of 
cosmological parameters: the physical baryon and CDM densities, LOb = f^/i 2 and ui c = O c /i 2 , 
the cosmological constant in units of the critical density, Qa, the scalar spectral index and 
the overall normalization of the adiabatic power spectrum, n s and A s = £q, cf. Eq. (4.6, page 
79), the redshift at which the reionization fraction is a half, z re , and the primordial helium 
mass fraction, Y p . We restrict our analysis to flat models, therefore the Hubble parameter is 
a derived parameter, 

h= [K + w fe )/(l-0 A )] 1/2 . (6.22) 

We consider purely adiabatic initial conditions and three massless neutrino families for the 
reason given above. We do not consider either gravitational waves or massive neutrinos. We 
include the WMAP data from Kogut et al. (2003); Hinshaw et al. (2003b) (temperature and 
polarization) with the routine for computing the likelihood supplied by the WMAP team 
(Verde et al., 2003). We make use of the CBI (Pearson et al., 2003) and of the decorrelated 
ACBAR (Kuo et al., 2004) band powers above i = 800 to cover the small angular scale region 
of the power spectrum. 
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Figure 6.9: One-dimensional posterior likelihood distribution for the helium mass fraction, 
Y p , using CMB data only. The solid-black line is for all other parameters marginalized, the 
dashed-red line gives the mean likelihood. 

Since Y p is a rather flat direction in parameter space with present-day data, we find that 
a much larger number of samples is needed in order to achieve good mixing and convergence 
of the chains in the full 7D space. We use M = 4 chains, each containing approximately 
N = 3 • 10 5 samples. The mixing diagnostic is carried out along the same lines as in Verde 
et al. (2003), by means of the Gelman and Rubin criterion (Gelman & Rubin, 1992b). The 
burn-in of the chains also takes longer than in the case where Y p is held fixed, and we discard 
6000 samples per chain. 

Results 

Marginalizing over all other parameters, we find that the helium mass fraction from CMB 
alone is constrained to be 

Y p < 0.647 at 99% I.e. (1 tail limit) (6.23) 
and 0.160 <Y P < 0.501 at 68% I.e. (2 tails). (6.24) 

Thus, for the first time the primordial helium mass fraction has been observed using the 
cosmic microwave background. However, present-day CMB data do not have by far sufficient 
resolution to discriminate between the astrophysical helium measurements, Y p ~ 0.244, and 
the deuterium guided BBN predictions, Y p ~ 0.248, which would require percent precision. 

In Fig. 6.9 we plot the marginalized and the mean likelihood of the Monte Carlo samples 
as a function of Y p . If the likelihood distribution is Gaussian, then the 2 curves should be 
indistinguishable. The difference between marginalized and mean likelihood for Y p indicates 
that the marginalized parameters are skewing the distribution, and therefore that correlations 
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Figure 6.10: Joint 68% and 99% likelihood contours in the (wj,, Yp)-plane from CMB data 
alone. The solid-blue line gives the BBN prediction (Buries et al., 1999), which on this figure 
almost looks like a straight line. 

play an important role. Although the mean of the ID marginalized likelihood is rather 
high, (L(Y P )) = 0.33, the mean likelihood peaks in the region indicated by astrophysical 
measurements, Y p ~ 0.25. In view of this difference, it is important to understand the role 
of correlations with other parameters, and we will turn to this issue now. 

In Fig. 6.10 we plot joint 68% and 99% confidence contours in the (u>b, Yp)-space. From 
the Monte Carlo samples we obtain a small and negative correlation coefficient between the 
two parameters, corr(YJ,, LOb) = —0.14. Baryons and helium appear to be anticorrelated sim- 
ply because present-day WMAP data do not map the peaks structure to sufficiently high t. 
Precise measurements in the small angular scale region should reveal the expected positive 
correlation between the baryon and helium abundances, which is potentially important in or- 
der to correctly combine BBN predictions and CMB measurements of the baryon abundance. 
We turn to this question in more detail in the next section. In BBN the baryon fraction 
and helium fraction are correlated along a different direction, cf. Fig. 6.10. However, this 
correlation is very weak, and the BBN relation gives practically a flat line. Since the two 
parameters are not independent from the CMB point of view, it is in fact not completely 
accurate to perform the CMB analysis with fixed helium mass fraction of Y p = 0.24 to get the 
error-bars on the baryon fraction, and then re-input this baryon fraction (and error-bars) to 
predict the helium mass fraction from BBN. The most accurate procedure is to analyse the 
CMB data leaving Y p as a free parameter, thereby obtaining the correct (potentially larger) 
error-bars on u>b upon marginalization over Y p . 

In view of the emerging baryon tension between CMB and BBN, it is important to check 
whether allowing helium as a free parameter can significantly change the CMB determination 
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of the baryon density or its error. In order to evaluate in detail the impact of Y p on the error- 
bars for uJb, we consider the following three cases. 

(a) The usual case, when the helium fraction for the CMB analysis is assumed to be known 
a priori and is fixed to the canonical value Y p = 0.24. 

(b) A case with a weak astrophysical Gaussian prior on the helium fraction, which we 
take to be Y p = 0.24 ± 0.01. As discussed above, the error-bars of the astrophysical 
measurements are typically a factor 5 tighter than this, but our prior is chosen to 
encompass the systematic spread between the different observations. 

(c) The case in which we assume a uniform prior for Y p in the range < Y p < 1, i.e. Y p is 
considered as a totally free parameter. 

We do not find any significant change in the error-bars for LOb in the three different cases. 
The confidence intervals on ojb alone are determined to be (case (c)) 0.0221 < uib < 0.0245 at 
68% I.e. (0.0204 < Lo b < 0.0276 at 99 % I.e.). The standard deviation of io h as estimated from 
the Monte Carlo samples is found to be &b = 1.3 • 10~ 3 . This is in complete agreement with 
the error-bars on LOb obtained by the WMAP team for the standard ACDM case (Spergel 
et al., 2003). We conclude that at the level of precision of present-day CMB data, it is still 
safe to treat the baryon abundance and the helium mass fraction as independent parameters. 
This result is non-trivial, since the fact that the damping tail is not yet precisely measured 
above the second peak would a priori suggest that degeneracies between Y p , u>b and n s could 
potentially play a role once the assumption of zero uncertainty on Y p is relaxed. The impact 
of Y p is small enough, and the error-bars on u>b large enough that a uniform prior on Y p can 
still be accommodated within the uncertainty in the baryon abundance obtained for case (a). 
However, the Y p — u>b correlation will have to be taken into account to correctly analyze future 
CMB data, with a quality such as Planck. We discuss this potential in the next section. 

We observe the expected correlation between the redshift of reionization and the helium 
fraction (Fig. 6.11), which is discussed above. The correlation coefficient between the two 
parameters is found to be rather large and positive, corr(Yp, z TC ) = 0.40. This correlation 
produces a noticeable change in the marginalized ID-likelihood distribution for z re as we go 
from case (a) to case (c). Marginalization over the additional degree of freedom given by 
Y p broadens considerably the error-bars on z TC . In fact, the 68% confidence interval for z TC 
increases by roughly 20% (and shifts to somewhat higher values), from 10.2 — 20.9 (case (a)) 
to 10.6 — 23.3 (case (c)). Case (b) exhibits similar error-bars as case (a). On the other hand, 
the determination of the reionization optical depth is not affected by the inclusion of helium 
as a free parameter, giving in all cases 0.08 < r re < 0.23. Correspondingly, the correlation 
is less significant, corr(l^, r re ) = —0.11. We therefore conclude that the differences in the 
determination of z ve are due only to the variation of the amount of electrons available for 
reionization as Y p is changed. 

Leaving 7 p as a free parameter also has an impact on the relation between LOb and the 
scalar spectral index, n s . The extra power suppression on small scales which is produced by 
a larger Y p can be compensated by a blue spectral index, cf. Fig. 6.12. 
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Figure 6.11: Joint 68% and 99% likelihood contours in the (Y p , z re )-plane (upper panel) 
and in the corresponding (Y p , r re )-plane (bottom panel) from CMB data alone. In the upper 
panel, the solid-red line is the relation z rc (Y p ) from Eq. (4.38, page 87), obtained by fixing the 
reionization optical depth to the value r re = 0.166, while the other parameters are those of our 
fiducial ACDM model. Although clearly the exact shape of z re (Y p ) depends on the particular 
choice of cosmology, it is apparent that the Y p — z m degeneracy is along this direction. The 
correlation between Y p — r re is almost negligible with present-day data (bottom panel). 

6.2.5 Potential of future CMB observations 

In order to estimate the precision with which future satellite CMB measurements will be 
able to constrain the helium mass fraction we perform a Fisher matrix analysis along the 
lines presented in § 5.2. As already emphasized, in order to obtain a reliable prediction, it is 
extremely important to choose a parameter set whose effect on the CMB power spectrum is 
as linear and uncorrelated as possible. Here we improve upon the choice made in § 6.1.5 by 
adopting the full set of normal parameters introduced in § 4.2. Our nine dimensional basis 
parameter set is then 

6 = {A,B,V,TZ,M,T,A s ,n s ,Y p } , (6.25) 

where the scalar power spectrum normalization constant is A s = £q, see (4.6, page 79). The 
quantities A,B,V,TZ,Ai,T are defined in Eqs. (4.43-4.47, page 90). It has been shown that 
the normal parameter set is very well adapted to the FMA, which give accurate predictions 
(Kosowsky et al., 2002). Since here we are interested in the predictions for B = fl^h 2 and Y p , 
we do not need to explicitly map the FMA forecasts in the normal parameter space onto the 
cosmological parameter space. 

The choice of the physical parameter set makes it easy to implement in the FMA interest- 
ing theoretical priors. For instance, we are interested in imposing flatness in our forecast, in 
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Figure 6.12: Scatter plot in the LOb — n s plane, with the value of Y p rendered following the 
color scale. Green corresponds roughly to the BBN preferred value. 

order to be able to directly compare present-day accuracy on Y p with the potential of Planck 
and of and ideal CMB experiment (see below). The prior on the curvature of the universe 
is imposed in the FMA by fixing the value of the parameter A to the one of the fiducial 
model. In fact, the parameter A is a generalization of the shift parameter, which describes 
the sideways shift of the acoustic peak structure of the CMB power spectrum as a function of 
the geometry of the universe and its content in matter, radiation and cosmological constant. 
Although imposing A = const is not the same as having a constant spatial curvature over 
the full range of cosmological parameters, for the purpose of evaluating derivatives the two 
conditions reduce to the same. The fact that our fiducial model is actually slightly open 
(see below), does not make any substantial difference in the results, apart from reducing the 
numerical inaccuracies which would arise had we computed the derivatives around an exactly 
flat model. We can also easily impose a prior knowledge of the helium fraction, by fixing 
the value of Y p , as is usually the case for present CMB analyses, and investigate how this 
modifies the expected error on the the baryon density. 

Accuracy issues 

We numerically compute double sided derivative of the power spectrum around the fiducial 
model with cosmological parameters given in Table 6.4. We find it necessary to increase 
the accuracy of CAMB by a factor of 3 in each of the "accuracy boost" values. As a fiducial 
model, we use the best fit model to the WMAP data for the standard ACDM scenario, as 
given in Table 1 of Spergel et al. (2003). However, in order to avoid numerical inaccuracies 
which arise when differentiating around a flat model, we reduce slightly the value of Oa by 
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Parameter 



Value 



Baryons 
Matter 



0.046 
0.270 
0.720 



<m 



Massless v families 
Total density 
Hubble constant 



Dark Energy 
Radiation 




7.95 • 10~ 5 



3.04 
0.990 
0.72 



Optical depth 
Spectral index 
Normalization 



A s 



re 



0.99 
2 • 10~' 



0.166 



Table 6.4: Cosmological parameters for the fiducial ACDM model around which the FMA is 
performed. We choose a slightly open model to avoid numerical inaccuracies in the derivatives. 

imposing an open universe, il^ — 0.99. 

We perform the FMA for the expected capabilities of Planck's High Frequency Instru- 
ment (HFI) and for an ideal CMB measurement which would be cosmic variance limited 
(CVL) both in temperature and in E-polarization (and we do not consider the B-polarization 
spectrum), and therefore represents the best possible parameter measurement from CMB 
anisotropies alone. The complicated issues coming from foreground removals, point source 
subtraction, etc. are assumed to be already (roughly) taken into account by the experi- 
mental parameters, see § 5.2.1 for definitions. These are the effective percentage sky cov- 

T E 

erage / s k y , the number of channels, the sensitivity of each channel a c ' for temperature 

T E 

(T) and E-polarization (E) in fiK and the angular resolution 9 C ' (in arcmin). For Planck 
HFI, we take the three channels with frequencies 100, 143 and 217 GHz, with respectively 
crf =12i3 = 5.4,6.0,13.1 and cr^ 2 3 = 11.4,26.7 and we have / sky = 0.85 (Planck Website, 
2004) Since the CVL is an ideal experiment, we put its noise to zero and assume perfect 
foregrounds removal, so that / s k y = 1. In order to test the accuracy of our predictions 
and compare present-day results with the forecasts, we also perform an FMA with WMAP 
first year parameters, obtaining excellent agreement between the FMA results and the error- 
bars from actual data. For the purpose of comparison, we include forecasts for the full 
WMAP four year mission, which will also measure E-polarization and reduce present-day 
errors on the temperature spectrum by a factor of two. We limit the range of multipoles 
to i < 2000, because at smaller angular scales non-primary anisotropies begin to dominate 
(Sunyaev-Zeldovich effect). Seljak et al. (2003b) discuss the issue of numerical precision of 
three different CMB codes and conclude that they are accurate to within 0.1%. While this 
is encouraging, it is not of direct relevance to this work, since what matters in the computa- 
tion of derivatives is not much the absolute precision of the spectra, but rather their relative 
accuracy. 

Forecasts and discussion 



Table 6.5 summarizes our forecasts for the future measurements and compares them with the 
results obtained from WMAP actual data. 
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Flatness 


Flatness and 












Y p = 0.24 






At^ 










y P 


u> b 




u> b 




WMAP 4yrs 1 


~ 50 


2.92 


~ 40 


2.86 


2.86 


Planck 


7.60 


1.31 


4.96 


1.26 


0.70 


CVL 


2.59 


0.34 


1.52 


0.32 


0.13 


Temperature + TE-cross 


WMAP 1st yr 2 


N/A 


N/A 


71.25 


5.04 


5.04 


WMAP 4yrs 1 


~ 75 


4.10 


~ 60 


3.94 


3.94 


Planck 


8.91 


1.74 


6.60 


1.63 


0.74 


CVL 


5.18 


0.55 


2.84 


0.55 


0.19 



Table 6.5: Fisher matrix forecasts and comparison with present-day results for different priors 
and using different combinations of temperature and polarization CMB spectra. Errors are 
in percent with respect to the values of the fiducial model, Y p = 0.24 and = 0.0238 (la 
I.e. all other marginalized). 

We notice that when the WMAP full four year data will be available (including E- 
polarization), the error on the baryon density is expected to decrease by a factor of two 
to about 3%, compared to today's 5% (assuming flatness). Nevertheless, inclusion of Y p as 
a free parameter will still have no effect on the determination of cob for WMAP, i.e. Y p will 
remain an essentially flat direction when marginalized over. While the determination of the 
helium fraction will improve, the FMA cannot reliably assess quantitatively how much, since 
for such large errors the likelihood distribution is not Gaussian and the quadratic approxi- 
mation breaks down. In the table we therefore give the FMA estimation as an indication, 
with the caveat that the Fisher approximation is likely to be inaccurate for the real errors 
on Y p from WMAP's four year data-set. 

It is interesting that for Planck, the effect of the helium fraction can no longer be neglected. 
Inclusion of the helium fraction increases the error on a>& by roughly 80%, from 0.7% to 1.3%. 
The correlation between the two parameters will have to be taken into account, as is evident 
from Fig. 6.13. The expected correlation coefficient is corr(Y p , uJb) = 0.84 (0.91) for Planck 
(for CVL). The expected la error on Y p is about 5% for Planck, or AY P ~ 0.01. This is of the 
same order as the spread in current astrophysical measurements. We conclude that in Planck- 
accuracy data analysis, it will be necessary to include the uncertainty in the determination 
of the helium mass fraction, at least in the form of a Gaussian prior over Y p of the type we 
used in the CMB data analysis presented above. 

Finally, measuring CMB temperature and polarization with cosmic variance accuracy 
would allow Y p to be constrained to within 1.5%, or AY p ~ 0.0036 (assuming flatness). Such 
an ideal measurement would be able to discriminate between the BBN-guided, deuterium 
based helium value and the current lowest, direct helium observations (cf. Fig. 6.8). 

Our forecasts for the uncertainty in the Helium mass fraction from future observations are 
in excellent agreement with the findings of Kaplinghat et al. (2003b). There, the standard 
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Figure 6.13: FMA forecast for the expected errors from WMAP four year mission (dotted- 
black), Planck (dashed-red) and a CVL experiment (solid-green). The ellipses encompass 
la" and 3a joint likelihood regions for Uf, — Y p (all other parameters marginalized). The axis 
values give the error in with respect to the fiducial model values. This forecast is for the full 
CMB information (Temperature, TE-cross, E-polarization) and assumes flatness. 

deviation on Y p for Planck is estimated to be AY p = 0.012. Kaplinghat et al. (2003b) 
also consider an experiment (CMBPol) with characteristics similar to our CVL, for which 
they forecast AY P = 0.0039, again in close agreement with our result. In an earlier work, 
Eisenstein et al. (1998a) found for Planck (temperature and polarization) AY P = 0.013, also 
in satisfactory concordance with our result. It should be noticed that the forecast reported 
for MAP in Table 2 of Eisenstein et al. (1998a), namely AY p = 0.02, is nothing but the 
Gaussian prior Y p = 0.24 ± 0.02 which was assumed in their analysis. 

The main source of improvement for the determination of Y p will be the better sampling of 
the temperature damping tail provided by Planck and the CVL. Polarization measurements 
have mainly the effect of reducing the errors on other parameters. In fact, we have checked 
that excluding from our FMA the 2 < I < 50 region of the E-polarization and ET-correlation 
spectra changes the forecast precision on Y p less than about 10-15% for Planck and less than 
a few percent for CVL. This supports the conclusion that the low-^ reionization bump is not 
very useful in measuring the helium abundance, because of the degeneracy with z re . 
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6.3 Time variations of the fine- structure constant 

The search for observational evidence for time or space variations of the 'fundamental' con- 
stants that can be measured in our four- dimensional world is an extremely exciting area of 
current research, with several independent claims of detections in different contexts emerging 
in the past few years. In particular, possible time variations of the fine-structure constant 
can be tested with the CMB, and represent another line of investigation going beyond the 
standard description of cosmology. The contents of this section summarize the latest result of 
a rather large collaboration I have been involved with, aimed at constraining time variations 
of the fine-structure constant using CMB anisotropy. We thoroughly studied the issue of 
crucial degeneracies with other cosmological parameters and discussing what improvements 
can be expected with forthcoming data-sets (Martins et al., 2002, 2004; Rocha et al., 2004). 

We motivate the search for time variations of the fine-structure constant in § 6.3.1, and 
review the current observational status of observations other than the CMB in § 6.3.2. After 
presenting the relevance of the fine-structure constant for CMB anisotropies in § 6.3.3 and 
§ 6.3.4, in § 6.3.5 we provide up-to-date WMAP constraints on the value of a at the epoch of 
decoupling; § 6.3.6 is dedicated to a detailed Fisher matrix analysis which encompasses the 
standard parameters plus the fine-structure constant for the full WMAP four year data, for 
the Planck satellite and for a cosmic variance limited, ideal experiment. 

6.3.1 Motivation 

Cosmology and astrophysics play an increasingly important role as testing ground for our 
understanding of fundamental physics, since they provide us with extreme conditions (that 
one has no hope of reproducing in terrestrial laboratories) in which to carry out a plethora of 
tests and search for new paradigms. Perhaps the more illuminating example is that of mul- 
tidimensional cosmology: currently preferred unification theories (Polchinski, 1998; Damour, 
2003a) predict the existence of additional space-time dimensions, which will have a num- 
ber of possibly observable consequences, including modifications in the gravitational laws on 
very large (or very small) scales (Will, 2001) and space-time variations of the fundamental 
constants of nature (Martins, 2002; Uzan, 2003). 

The most promising case, and the one that has been the subject of most recent work and 
speculation, is that of the fine-structure constant 

e 2 

a = — (6.26) 

he 

where e is the electron charge, c the speed of light and h Planck's constant. 

There have been a number of recent reports of evidence for a time variation of fundamental 
constants (Webb et al., 2001, 2003; Murphy et al., 2001c; Ivanchik et al., 2003), which we 
review below. Apart from their obvious direct impact if confirmed, they are also crucial in a 
different, indirect way, since they provide us with an important (and possibly even unique) 
opportunity to test a number of fundamental physics models, such as string theory. Indeed 
here the issue is not if such a theory predicts such variations, but at what level it does so, 
and hence if there is any hope of detecting them in the near future, or if we have done it 
already. 
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On the other hand, the theoretical expectation in the simplest, best motivated model is 
that a should be a non-decreasing function of time (Damour & Nordtvedt, 1993; Santiago 
et al., 1998; Barrow et al., 2002). This is based on rather general and simple assumptions, 
in particular that the cosmological dynamics of the fine-structure constant is governed by a 
scalar field whose behavior is akin to that of a dilaton. If this is so, then it is particularly 
important to try to constrain it at earlier epochs, where any variations relative to the present- 
day value should be larger. However, one of the interpretations of the Oklo results is that a 
was larger at an epoch corresponding to a redshift of about z ~ 0.1 than today, whereas the 
quasar results indicate that a was smaller at z ~ 2 — 3 than today, see below for more details. 
If both results are validated by future experiments, then the above theoretical expectation 
must clearly be wrong, which would be a perfect example of using astrophysics to learn about 
fundamental physics. Playing devil's advocate, one could certainly conceive that cosmological 
observations of this kind could one day prove string theory wrong. Indeed, it has been argued 
(Damour, 2003a,b) that even the results of Webb and collaborators may be hard to explain 
in the simplest, best motivated models where the variation of the fine-structure constant is 
driven by the spacetime variation of a very light scalar field. 

Cosmic microwave background anisotropies provide a tool to measure the fine-structure 
constant at high redshift, being mostly sensitive to the epoch of decoupling, z ~ 1100. 

6.3.2 The observational status 

The recent explosion of interest in the study of varying constants is mostly due to the results 
of Webb and collaborators (Murphy et al., 2001b; Webb et al., 2001; Murphy et al., 2001c,a) 
of a 4cj detection of a fine-structure constant that was smaller in the past, 

Act K 

= (-0.72 ±0.18) x 10" 5 , z~ 0.5 -3.5; (6.27) 

a 

indeed, more recent work (Murphy et al., 2003; Webb et al., 2003) provides an even stronger 
detection. These results are obtained through comparisons of various transitions (involving 
various different atoms) in the laboratory and in quasar absorption systems, using the fact 
that the size of the relativistic corrections goes as (aZ) 2 . A number of tests for possible 
systematic effects have been carried out, all of which have been found either not to affect the 
results or to make the detection even stronger if corrected for. 

A somewhat analogous (though simpler) technique uses molecular hydrogen transitions 
in damped Lyman-a systems to measure the ratio of the proton and electron masses, fi = 
m p /m e (using the fact that electron vibro-rotational lines depend on the reduced mass of 
the molecule, and this dependence is different for different transitions). The latest results 
(Ivanchik et al., 2002) using two systems at redshifts z ~ 2.3 and z ~ 3.0 are 

(5.7 ±3.8) x 10" 5 , (6.28) 

(12.5 ± 4.5) x 10" 5 , (6.29) 

depending on which of the (two) available tables of "standard" laboratory wavelengths is 
used. This implies a 1.5cr detection in the more conservative case, though it also casts some 
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doubts on the accuracy of the laboratory results, and on the influence of systematic effects 
in general. 

We should also mention a recent re-analysis (Fujii, 2002) of the well-known Oklo bound 
(Damour & Dyson, 1996). Using new Samarium samples collected deeper underground (aim- 
ing to minimize contamination), these authors again provide two possible results for both a 
and the analogous coupling for the strong nuclear force, a s , 

-~^± = (0.4 ± 0.5) x lO^yr- 1 (6.30) 

a a s 

or 

^ ~ ^ = -(4.4 ± 0.4) x l(r 17 yr _1 • (6.31) 

Note that these are given as rates of variation, and effectively probe timescales corresponding 
to a cosmological redshift of about z ~ 0.1. Unlike the case above, these two values correspond 
to two possible physical branches of the solution. See Fujii (2002) for a discussion of why 
this method yields two solutions (and also note that these results have opposite signs relative 
to previously published ones, Fujii et al., 2000). While the first of these branches provides a 
null result, (6.31) is a strong detection of an a that was larger at z ~ 0.1, that is a relative 
variation that is opposite to Webb's result (6.27). Even though there are some hints (coming 
from the analysis of other Gadolinium samples) that the first branch is preferred, this is by 
no means settled and further analysis is required to verify it. 

Still we can speculate about the possibility that the second branch turns out to be the 
correct one. Indeed this would definitely be the most exciting possibility. While in itself 
this wouldn't contradict Webb's results (since Oklo probes much smaller redshift and the 
suggested magnitude of the variation is smaller than that suggested by the quasar data), it 
would have striking effects on the theoretical modelling of such variations. In fact, proof that 
a was once larger than today's value would sound the death knell for any theory which models 
the varying a through a scalar field whose behaviour is akin to that of a dilaton. Examples 
include Bekenstein's theory (Bekenstein, 1982) or simple variations thereof (Sandvik et al., 
2002; Olive & Pospelov, 2002). Indeed, one can quite easily see (Damour & Nordtvedt, 1993; 
Santiago et al., 1998) that in any such model having sensible cosmological parameters and 
obeying other standard constraints, a must be a monotonically increasing function of time. 
Since these dilatonic-type models are arguably the simplest and best-motivated models for 
varying a from a particle physics point of view, any evidence against them would be extremely 
exciting, since it would point towards the presence of significantly different, yet undiscovered 
physical mechanisms. 

Finally, we also mention that there have been recent proposals (Braxmaier et al., 2001) 
of more accurate laboratory tests of the time independence of a and [i using monolithic 
resonators, which could improve current bounds by an order of magnitude or more. 

However, given that there are both theoretical and experimental reasons to expect that 
any recent variations will be small, it is important to develop tools allowing us to measure 
a in the early universe, as variations with respect to the present value could be much larger 
then. 
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6.3.3 Effects of a on the ionization history 

The reason why the CMB can be used as a probe of variations of the fine-structure constant is 
that these alter the ionization history of the universe. Here we present the dominant effects, 
see Hannestad (1999); Kaplinghat et al. (1999) for a detailed treatment. 

The impact of the fine-structure constant on the CMB comes from the dependence of the 
differential optical depth f (2.117, page 40) on the Thomson scattering cross section, which 
is 

8ira 2 h 2 

aT = ' (6 - 32) 

where we have reintroduced the speed of light c and the Planck constant h, and m e is the elec- 
tron mass. Now the equilibrium electron ionization fraction x| q = n e /nH goes approximately 

as 

x| q oc(-f) exp(-5/T), (6.33) 



T 

where B is the Hydrogen binding energy 

B = a 2 m e c 2 /2 (6.34) 

(see e.g. Kolb & Turner, 1990). If we ignore the fact that x e (z) does not precisely track its 
equilibrium value, and since the exponential factor dominates near recombination, we would 
simply expect from T tx 1/a oc z that the reionization fraction be just a function of z/a 2 . 
This turns out to be approximately correct, even if the effect of the factor (m e /T) 3//2 and 
the departure of x e from xf 1 need to be taken into account for a more precise estimation 
(Kaplinghat et al., 1999). 

In general, around the decoupling epoch relevant for the CMB, the fine-structure constant 
can be expected to evolve with redshift, a = a(z), but we can take a constant value c^dec = 
a(zdec) instead and consider it as an effective value averaged over the recombination process. 
Summarizing, there are two important changes in the reionization history brought about by a 
change in ctdec, the value of a at the recombination epoch, which are best discussed in terms 
of changes on the visibility function g(z), defined in Eq. (3.126, page 67). A larger value of 
«dec with respect to ao, its value today, implies: 

• an increased redshift of last scattering: as estimated above, this follows from rescaling 
the reionization fractions as z/a^ ec , hence decoupling happens earlier for a larger ctdec, 
which means that the sound horizon r s (zd ec ), see Eq. (4.24, page 82), is smaller. As 
a consequence, we expect a shift of the peaks' structure to larger £ values, according 
to (4.17, page 81). This effect will be degenerate with the shift parameter 7£ shlft (4.22, 
page 82) or equivalently with the normal parameter A, Eq. (4.43, page 88), as shown 
in Fig. 6.14. There will also be a boost of the first acoustic peak due to the increased 
early ISW effect, see § 4.1.2.3. 

• A narrower peak of the visibility function: by increasing ct^ec the peak of the visibility 
function is moved to a larger redshift, when the expansion rate is faster 

Toe -Hot -(1 + z) (6.35) 

and thus the temperature and therefore x e drop faster, which makes g(z) narrower, see 
Fig. 6.15. This leads to a smaller damping scale, cf. Eq. (4.34, page 86), hence the 
small-scale power of the CMB spectrum increases for adec/ao > 1- 
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Figure 6.14: Left panel: derivatives of the temperature spectrum with respect to a^ ec and the 
shift parameter 7£ shlft . We plot —dCg/da^ec to facilitate the comparison with dCi/dlZ^ 1 . 
The two derivatives are perfectly in phase: this is responsible for the degeneracy between 
the corresponding parameters (right panel, Fisher matrix analysis). Only the different am- 
plitudes allow an experiment which maps sufficiently high multipoles with high accuracy to 
distinguish between them, in particular revealing the change in the damping scale brought 
about by changes in adec- In the right panel, the Fisher matrix results contain la of the 
likelihood (including temperature only), and clearly indicate a strong correlation between 
the two parameters (see Martins et ah, 2002). 

In Fig. 6.16 we plot the resulting CMB temperature spectrum, where the above mentioned 
changes are readily distinguishable. 

6.3.4 The role of reionization 

After decoupling, the CMB is essentially insensitive to how a varies, until the reionization 
epoch is reached, at which point Thomson scattering becomes effective again. If the value of a 
at reionization, a re = a(z re ), is different from its value today, it will affect the CMB spectrum 
through a change in the reionization optical depth r re . However, r re is itself dependent on the 
cosmological model and possibly on a number of relevant non-linear physical processes related 
to the astrophysical mechanisms responsible for the reionization. In general, this problem is 
solved by treating free parameter, which accounts for the relatively poor knowledge 

of the details of the reionization history and in our case for the uncertainty about the exact 
value of a during the reionization epoch. We conclude that provided we treat Tj-g ELS cl free 
parameter the lack of a precise knowledge of the value of a during the epoch of reionization 
is unimportant, and we can take a re = a®. On the more phenomenological side, the results 
of Webb and collaborators for the value of a at a redshift of 2 — 3 would suggest that at the 
epoch of reionization the possible changes in a relative to the present day are already very 
small. Therefore one can calculate the effect of a varying a by simply assuming two values 
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Figure 6.15: Ionization fraction as a function of redshift (left panel) and visibility function as 
a function of conformal time (right panel) for different values of the fine-structure constant 
at decoupling: Odec/^o = 1 (solid), adec/ a o = 1-03 (dotted), o^ec 

/a = 0.97 (dashed). 
Decoupling happens earlier and the last scattering surface is narrower for «dec/ao > 1- 

for the fine-structure constant, one at low redshift, z<20, for which we take today's value 
by the above argument, and one around the epoch of decoupling, adeo which we want to 
determine. 

As shown in § 4.1.3.2, reionization changes the amplitude of the acoustic peaks in the 
temperature spectrum, without affecting their position and spacing, while introducing the 
reionization bump at low I in the polarization spectrum. If the value of adec is different from 
the value today (which corresponds to a re ), then the peaks in the polarization power spectrum 
at small angular scales will be shifted sideways, while the reionization bump on large angular 
scales will remain fixed. This is illustrated in Fig. 6.16 (lower right panel). It follows that by 
measuring the separation between the acoustic peaks and the bump, one could in principle 
measure both a and the reionization optical depth r re , as shown in Fig. 6.17. This holds 
true as long as one assumes a specific reionization history, such as the sudden reionization 
scenario used here. However, if we would allow for a more realistic reionization modelling, 
the detailed dependence of the reionization bump on the new reionization parameters is likely 
to wash out this effect. Nevertheless, with present-day accuracy the CMB data are sensitive 
only to the optical depth of reionization, as pointed out in § 6.2.2, which justify the use of 
the simplest reionization modelling. Within this framework, the fact that r re unexpectedly 
turned out to be as large as 0.16 as derived from the WMAP data (Spergel et al., 2003) makes 
the prospects of constraining a with the CMB much better because of the above effect. 

Finally, we point out that the modifications discussed above are direct consequences of an 
a variation, and that indirect effects are usually present as well since any variation of a is 
necessarily coupled with the dynamics of the Universe (Mota & Barrow, 2004). Here we take 
a pragmatic approach and say that, since the CMB is insensitive to the details of a variations 
from decoupling to the present day, we do not in fact need to specify a redshift dependence 
for this variation - although we could have specified one if we so chose. At this stage, 
we prefer to focus on model-independent constraints, and hence do not attempt to include 
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Figure 6.16: Contrasting the effects of varying ctdec (left) and reionization optical depth r re 
(right) on the CMB temperature (top) and polarization (bottom). The reionization bump 
is not changed by variations of adec/^o- The black lines are for the WMAP best fit model, 
with adec/^o = 1 and r re = 0.17. 



an explicit modelling for the redshift dependence a(z). Nevertheless, given some model- 
independent constraints one can always translate them into constraints on the parameters 
of one's favorite model. Beside possible time variations of a, investigated here, one could 
also envisage searching for spatial variations on the last scattering surface (Sigurdson et al., 
2003). 



6.3.5 CMB constraints on a from WMAP alone 

We use a modified version of CMBFAST which includes the effects of varying a described 
above, to analyse the recent WMAP temperature and cross-polarization data adopting the 
likelihood estimator method described in Verde et al. (2003). The models are sampled on an 
uniform grid in a 7 dimensional parameter space as follows: 

0.05 < n c h 2 < 0.20 (0.01) , 

0.010 < n b h 2 < 0.028 (0.001) , 

0.500 <n A < 0.950 (0.025) , 

0.900 < a d ec/ao< 1-050 (0.005) , (6.36) 

0.06 < r re < 0.30 (0.02) , 

0.880 < n s < 1.08 (0.005) , 

-0.15 <-J^V < 0.05 (0.01). 
dm A; 

The numbers between parentheses give the step size along each direction; n s is the scalar 
spectral index of the primordial power spectrum, and dn s /dln k is the spectral index running, 
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Figure 6.17: The separation in I between the reionization bump and the first (solid lines), 
second (dashed) and third (dotted) peaks in the polarization spectrum, as a function of a 
at decoupling and r. A (somewhat idealized) description of how a and r re can be measured 
using CMB polarization. 

i.e. we introduce a scale dependence of the spectral index of the form 

».<*)=«.(*) + 5^ ln(|-), (6.37) 

where n s = n s (kp) is a constant and the pivot scale kp is chosen to be kp = 0.002Mpc _1 . 
We only include flat models, so that the Hubble parameter Hq = lOO/i km s _1 Mpc -1 is a 
derived quantity. We don't consider gravity waves or isocurvature modes since these further 
modifications are not required by the WMAP data. 

The likelihood distribution function for o^ec/ao, obtained after marginalization over the 
remaining parameters, see § 5.1.5, is plotted in Fig. 6.18, and gives the marginalized confi- 
dence interval 

0.95 < adecM) < 1-02 (at 95% I.e.). (6.38) 
If we impose dn s /dln£; = we obtain instead 

0.94 < a dec /a < 1.01 (at 95% I.e.). (6.39) 

It is interesting to consider the correlations between a a/ao and the other parameters in 
order to see how this modification to the standard model can change our conclusions about 
cosmology. In Fig. 6.19 we plot the likelihood contours in the a/ao —T re plane for two cases: 
using the temperature only WMAP data and including the TE cross correlation data. There 
is a clear degeneracy between these two parameters if one uses only temperature information: 
increasing the optical depth allows for an higher value of the spectral index n$ and a lower 
value of a/ao. Inclusion of the TE data is already able to partially break this degeneracy, 
but, as we explain below, more detailed measurements of the polarization spectra are needed 
to constraint separately the two parameters, 
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Figure 6.18: Marginalized likelihood distribution function for variations in the fine-structure 
constant at the time of decoupling obtained by an analysis of the WMAP data (TT+TE, 
one- year) . 

One of the most unexpected results from the WMAP data is the hint for a scale-dependence 
of the spectral index n s (see e.g. Peiris et al., 2003; Kinney et al., 2004). Such a dependence 
should not be detectable in most of the viable single field inflationary models and, if con- 
firmed, would have strong consequences on the possibilities of reconstructing the inflationary 
potential. For this reason we included the running of the spectral index in our parameter 
set. In Fig. 6.20 we plot likelihood contours in the a/ao — dn s /dlnk plane, showing that a 
lower value of a/ao would prefer the absence of running. As already pointed out in Bean 
et al. (2003), a modification of the recombination scheme can therefore provide a possible 
explanation for the large value of dn s /dln/c found from WMAP data. 

In previous (pre- WMAP) work, CMB-based constraints on a were obtained with the help 
of additional cosmological data-sets and priors, as in Martins et al. (2002). This procedure 
was exposed to the criticism that different data-sets could possibly have different systematic 
errors that are impossible to control and could conceivably conspire to produce the results 
quoted. The above results are obtain from WMAP only, and therefore eliminate this possible 
uncertainty. For earlier works and pre- WMAP constraints, see also Avelino et al. (2000, 
2001); Battye et al. (2001); Hannestad (1999). 

6.3.6 Fisher matrix forecasts and degeneracies 

We apply the Fisher matrix analysis (FMA) technique explained in § 5.2 to the problem of 
forecasting the expected precision in the determination of a^ec with CMB anisotropy. For 
the accuracy reasons presented at length in § 5.2, § 6.1.5 and § 6.2.5, we choose to employ 
the following 8 dimensional base parameter set 

6 = {n b h 2 ,n m h 2 ,n A h 2 ,n shm ,n s ,Q,r re ,a dec /a } (6.40) 

which takes into account the severe geometrical degeneracy via the shift parameter 7£ shlft , 
defined in Eq. (4.22). The quantity n s is the scalar spectral index (without running) and Q 
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Figure 6.19: Likelihood contour plot in the a^ec/^o — r re plane including temperature infor- 
mation only (TT) and TT+TE together from WMAP (68% and 95% I.e. from the inside 
out). The inclusion of polarization data partially breaks the degeneracy between these two 
parameters. 



a phenomenological normalization parameter as in (6.13, page 131). We restrict ourselves to 
scalar modes and adiabatic initial conditions. 

The maximum likelihood model around which the FMA for Planck and the CVL is 
performed has parameters Uf, = 0.0200, uj m = 0.1310, uj\ = 0.2957 (and h = 0.65), 
TZ shiit = 0.9815, n s = 1.00, Q = 1.00, r = 0.20 and a/a = 1.00. We differentiate around 
a slightly closed model (as preferred by WMAP) with £l tot = 1.01 to avoid extra sources 
of numerical inaccuracies, since open and closed models are computed by cmbfast using 
different numerical techniques which would introduce unwanted inaccuracies. 

Regarding numerical accuracy issues in the computation of the Fisher matrix, we imple- 
ment in the present work double-sided derivatives, which reduce the truncation error from 
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Table 6.6: Experimental parameters for WMAP and Planck (nominal mission). Note that 
we express the sensitivities in fj,K. See § 5.2.1 for definitions. 
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Figure 6.20: Likelihood contour plot in the adec/c^o — dn s /dlnA; plane, from WMAP tem- 
perature and ET correlation data (68%, 95% and 99% I.e. from the inside out). A zero scale 
dependence, as expected in most of the inflationary models, seems to be more consistent with 
a value of adec/ a o < 1- 

second order to third order terms. The choice of the step size is a trade-off between truncation 
error and numerical inaccuracy dominated cases. For an estimated numerical precision of the 
computed models of order 10~ 4 , the step size should be approximately 5% of the parameter 
value (Press et al., 1992), though it turns out that for derivatives in direction of a and n s 
the step size can be chosen to be as small as 0.1%. After several tests, we have chosen step 
sizes varying from 1% to 5% for a;;,, w m , oja and lZ shlit . This choice gives derivatives with 
an accuracy of about 0.5%. The derivatives with respect to Q are exact, being the power 
spectrum itself. 




Predictions for WMAP's four year data 

We present here the main results of the Fisher matrix forecasts; the full tables and more 
detailed comments can be found in Rocha et al. (2004). We first concentrate on the potential 
of the WMAP four year data, and we compare in Tables 6.7 and 6.8 the expected errors for 
two cases, for the base set of parameters (6.40) with and without inclusion of ad ec / a o- In 
both cases, we take as reference model for the Fisher matrix the WMAP best fit model of 
Table 1, in Spergel et al. (2003), but with a slightly larger cosmological constant which gives 
fitot = 1-01, for the accuracy reasons explained above. 

Table 6.7 gives accurate predictions for the errors on standard cosmological parameters, for 
models including non-flat cosmologies. Clearly, with the WMAP sensitivity, E-polarization 
alone will not constrain much the parameters, but combining temperature information with 
the polarization channels will reduce the errors on the baryon and matter density and on the 
shift parameter by about a factor of three, with all other parameters marginalized over. The 
error on the cosmological constant will remain of order unity, since this is an expression of 
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324.06 


spectral index 


n s 


69.43 


4.89 


198.58 


6.68 


0.53 


19.11 


normalization 


Q 


79.22 


13.51 


226.58 


0.90 


0.32 


2.58 


shift parameter 


-j^shift 


46.52 


13.04 


133.06 


9.25 


0.59 


26.47 


reionization optical depth 




100.84 


8.21 


288.40 


102.72 


16.70 


293.79 






Temp+Pol (TT+EE) 


All (TT+EE+TE) 


baryon density 


U b 


2.14 


0.80 


6.11 


2.13 


0.80 


6.08 


matter density 




3.09 


0.77 


8.85 


3.08 


0.77 


8.81 


A density 




90.70 


63.84 


259.41 


86.97 


62.69 


248.75 


spectral index 


n s 


1.46 


0.52 


4.18 


1.45 


0.52 


4.15 


normalization 


Q 


0.52 


0.32 


1.48 


0.52 


0.32 


1.48 


shift parameter 


-j^shift 


2.86 


0.59 


8.17 


2.84 


0.59 


8.12 


reionization optical depth 


''"re 


10.52 


7.45 


30.08 


10.41 


7.44 


29.78 



Table 6.7: Fisher matrix analysis results for a standard model with inclusion of reionization 
(for the WMAP best fit model as the fisher analysis fiducial model, with r re = 0.17): expected 
la errors from the WMAP-four year data. The column marg. gives the error with all other 
parameters being marginalized over; in the column fixed the other parameters are held fixed 
at their ML value; in the column joint all parameters are being estimated jointly. 



the geometrical degeneracy which is fundamentally unbreakable without external priors. The 
spectacular improvement of about a factor 10 in determining r re with polarization informa- 
tion is a consequence of the expected measurement of the reionization induced polarization 
bump, which breaks the degeneracy with normalization present with temperature alone. The 
spectral index accuracy thus increases by a factor 4, because the better determination of 
the reionization optical depth assists into breaking the small scale degeneracy with n s . The 
column "fixed" gives the best case scenario in which all other parameters are assumed to be 
known and fixed to their fiducial model value. In this case, the errors obtained by combining 
all channels are below 1% for all parameters but the cosmological constant. 

Let us now compare this forecasts with the corresponding entries in Table 6.8, where the 
parameter a^c/otQ has been added. The addition of a varying fine-structure constant opens 
up new degeneracy directions, hence the marginalized and joint error forecasts get worse (but 
not the errors with all other parameters fixed, of course). The most degenerate direction is 
with the shift parameter (marginalized errors larger by a factor 7 with all channels), as 
expected from the above considerations. Due to its effect on the peak heights, the fine- 
structure constant is largely degenerate with ujb up to the second acoustic peak; an accurate 
mapping of the large multipole temperature spectrum can nevertheless lift this degeneracy, 
also constraining better ra s , see Martins et al. (2002) for details. This explains the larger 
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Quantity la errors (%) 

WMAP four year 







marg. 


fixed 


joint 


marg. 


fixed 


joint 






Polarization (EE) 


Temperature 


(TT) 


baryon density 


LO b 


173.74 


16.58 


496.91 


14.09 


0.81 


40.30 


matter density 


U m 


260.62 


17.16 


745.40 


13.76 


0.77 


39.36 


A density 


WA 


637.28 


97.58 


1822.66 


133.73 


83.39 


382.47 


spectral index 


n s 


108.18 


4.89 


309.41 


7.86 


0.53 


22.47 


normalization 


Q 


96.60 


13.51 


276.30 


2.33 


0.32 


6.67 


shift parameter 




133.23 


13.04 


381.04 


26.29 


0.59 


75.19 


fine structure constant 


C^dec 


69.10 


2.48 


197.62 


5.83 


0.12 


16.66 


reionization optical depth 


7~re 


228.69 


8.21 


654.07 


103.86 


16.70 


297.05 






Temp+Pol (TT+EE) 


All (TT+EE+TE) 


baryon density 


U)b 


7.50 


0.80 


21.44 


7.41 


0.80 


21.18 


matter density 




5.48 


0.77 


15.66 


5.46 


0.77 


15.62 


A density 




91.57 


63.84 


261.91 


87.48 


62.69 


250.20 


spectral index 


n s 


2.03 


0.52 


5.82 


2.03 


0.52 


5.81 


normalization 


Q 


1.31 


0.32 


3.73 


1.30 


0.32 


3.71 


shift parameter 




14.34 


0.59 


41.01 


14.17 


0.59 


40.53 


fine structure constant 


C^dec 


3.08 


0.11 


8.80 


3.05 


0.11 


8.71 


reionization optical depth 


''"re 


10.65 


7.45 


30.46 


10.52 


7.44 


30.08 



Table 6.8: Fisher matrix analysis results for the model of Table 6.7 with inclusion of «dec- 



errors on the baryon density and on the spectral index as we include a in the parameter set. 
However, the optical depth determination remains almost unaffected, as a consequence of 
the simultaneous measurement of the reionization bump's position and of the acoustic peaks 
angular scale, thereby validating our method for the restricted class of sudden reionization 
models considered here. 

Predictions for Planck and an ideal experiment 

We now focus on the Fisher matrix forecasts for the expected performance of the Planck 
satellite, and compare them with the results for an ideal CMB experiment, which would 
map both temperature and E-polarization with cosmic variance limited (CVL) accuracy up 
to i = 2000. Clearly, such a measurement is not feasible in practice, because of foreground 
removal and limited instrumental sensitivity, but it represents in principle the best possible 
parameters determination using CMB alone. The full results are tabulated in Table 6.9 and 
Table 6.10. In order to clarify the role of correlations between parameters, we plot in Figures 
6.21 and 6.23 the 2a joint likelihood contours for all couples of parameters for Planck, and 
in Figures 6.22 and 6.24 for the CVL experiment. 

The first important fact is that E-polarization data alone from Planck will constrain the 
standard parameters better than the four year WMAP temperature data alone, compare 
Table 6.7 with Table 6.9. This follows from the fact that the polarization spectrum is less 
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plagued by large scale degeneracies than the temperature spectrum. Furthermore, as apparent 
from Fig. 6.21, degeneracy directions for the temperature spectrum are in many cases almost 
orthogonal to the directions in the polarization channel. This is especially the case for r re , and 
in fact combining temperature and polarization information reduces its marginalized error 
from 16% (6%) for temperature (polarization) alone to 4%. In general, the WMAP four year 
error-bars will be approximately halved for all parameters by Planck. Another significant 
aspect is that by comparing the temperature only column for Planck to the one for the CVL 
experiment, we conclude that Planck will be essentially cosmic variance limited as far as the 
temperature spectrum is concerned. This is not the case for the polarization channel, for 
which there will still be room for a substantial improvement over Planck's capabilities: the 
CVL experiment can do better than Planck by a factor 5 or more on average. The comparison 
of Figures 6.21 and 6.22 immediately confirms this conclusion, which makes a strong case for 
a post-Planck, polarization-dedicated experiment. 

When we add the fine-structure constant to the Planck parameter set, the ellipses for tem- 
perature and polarization get larger for all the couples of parameters involving degenerate 
directions with a, compare Fig. 6.23 with Fig. 6.21. As before, this happens mostly for the 
TZ shii \ n s and r re using temperature information only. The degradation of the accuracy on 
those parameters is less dramatic than for WMAP, because Planck will map the spectrum 
to larger multipoles. It is remarkable that the combined temperature and polarization error 
does not grow very much when we add a, because the degeneracies are in different direc- 
tions for the two channels. The fine-structure constant is the only parameter which Planck 
will constrain better with temperature only (0.7%) than with polarization only (2.7%, all 
others marginalized), while the situation is opposite for r re , 27% for temperature and 9% 
for polarization. Combining the two channels again lifts most of the degenerate directions, 
and we conclude that Planck will achieve an accuracy on ad ec of order 0.3% (la, all others 
marginalized), thus improving by about a factor of 10 on the expected performance of the 
four year WMAP mission and a factor of 5 on the current upper bound (obtained however 
under the assumption of flatness). At the same time, the reionization optical depth will be 
constrained to about 4.5%. Our findings for adec/^o and r re are summarized in Fig. 6.25, 
where we compare degeneracy directions in the adec/^O) r re plane for temperature alone, po- 
larization alone and the combined channels, for Planck and the CVL experiment. We also 
superimpose the corresponding forecast for the WMAP four year mission (all channels) in 
order to facilitate the comparison. 

The columns in Table 6.10 regarding the CVL experiment and the corresponding Fig. 6.24 
give information about further improvements on Planck's parameter accuracy. As mentioned, 
a cosmic variance limited measurement of polarization could further reduce Planck's error- 
bars by a factor 2 to 3, reaching the highest possible accuracy from CMB alone. In particular, 
our analysis indicate that CMB alone can constrain variations of a up to 0(1O -3 ) at z ~ 1100. 
Going beyond will require additional priors on the other parameters. 
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Quantity la errors (%) 

Planck HFI CVL 
marg. fixed joint marg. fixed joint 



Polarization only (EE) 



baryon density 


UJ b 


6.21 


1.11 


17.75 


0.48 


0.25 


1.38 


matter density 




3.37 


0.39 


9.64 


0.70 


0.03 


1.99 


cosmological constant density 




37.37 


22.87 


106.89 


11.40 


9.99 


32.61 


spectral index 


n B 


1.53 


0.96 


4.38 


0.30 


0.08 


0.86 


normalization 


Q 


2.23 


0.51 


6.38 


0.24 


0.07 


0.67 


shift parameter 


•^shift 


3.33 


0.35 


9.52 


0.65 


0.03 


1.86 


reionization optical depth 




5.74 


2.78 


16.42 


1.81 


1.52 


5.18 








Temperature only ( 


TT) 




baryon density 


UJ b 


0.86 


0.60 


2.46 


0.57 


0.38 


1.64 


matter density 




1.51 


0.13 


4.31 


1.10 


0.08 


3.14 


cosmological constant density 




110.15 


96.15 


315.03 


98.15 


86.00 


280.72 


spectral index 


n s 


0.54 


0.13 


1.56 


0.36 


0.07 


1.04 


normalization 


Q 


0.20 


0.11 


0.56 


0.17 


0.07 


0.50 


shift parameter 


-j^shift 


1.47 


0.12 


4.21 


1.05 


0.07 


3.01 


reionization optical depth 


7~re 


16.50 


8.28 


47.20 


14.02 


5.89 


40.09 






Temperature 


and Polarization (TT+ 


EE) 


baryon density 


UJ b 


0.80 


0.53 


2.30 


0.32 


0.21 


0.92 


matter density 




1.24 


0.12 


3.55 


0.55 


0.03 


1.58 


cosmological constant density 




30.58 


22.04 


87.46 


10.72 


9.85 


30.65 


spectral index 


n B 


0.43 


0.13 


1.23 


0.20 


0.05 


0.58 


normalization 


Q 


0.19 


0.10 


0.53 


0.14 


0.05 


0.41 


shift parameter 


-j^shift 


1.22 


0.11 


3.48 


0.52 


0.03 


1.49 


reionization optical depth 




4.04 


2.65 


11.56 


1.73 


1.48 


4.96 



Table 6.9: Fisher matrix analysis results including reionization (r re = 0.20): expected la 
errors for the Planck satellite and for cosmic variance limited (CVL) experiment. 
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Figure 6.21: Ellipses containing 95.4% (2a) of joint confidence (all other parameters marginal- 
ized) using temperature alone (red), E-polarization alone (yellow), and both jointly (white), 
for a standard model with inclusion of reionization (r re = 0.20). Fisher matrix forecast for 
the Planck HFI instrument. 
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Figure 6.22: Ellipses containing 95.4% (2a) of joint confidence (all other parameters marginal- 
ized) using temperature alone (red), E-polarization alone (yellow), and both jointly (white), 
for a standard model with inclusion of reionization (r re = 0.20). Fisher matrix forecast for 
an ideal cosmic variance limited (CVL) experiment. 
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Quantity lcr errors (%) 

Planck HFI CVL 
marg. fixed joint marg. fixed joint 



Polarization only (EE) 



baryon density 




6.46 


1.11 


18.47 


1.09 


0.25 


3.12 


matter density 




7.75 


0.39 


22.17 


1.61 


0.03 


4.60 


cosmological constant density 




41.61 


22.87 


119.01 


11.60 


9.99 


33.17 


spectral index 


n s 


4.14 


0.96 


11.85 


0.77 


0.08 


2.22 


normalization 


Q 


2.99 


0.51 


8.55 


0.24 


0.07 


0.68 


shift parameter 


•^shift 


9.56 


0.35 


27.33 


1.19 


0.03 


3.40 


fine structure constant 


C^dec 


2.66 


0.06 


7.62 


0.40 


< 0.01 


1.14 


reionization optical depth 


7"re 


8.81 


2.78 


25.19 


2.26 


1.52 


6.45 








Temperature only ( 


TT) 




baryon density 


U) b 


1.09 


0.60 


3.12 


0.83 


0.38 


2.37 


matter density 




3.76 


0.13 


10.74 


2.64 


0.08 


7.55 


cosmological constant density 




111.61 


96.15 


319.21 


98.97 


86.00 


283.05 


spectral index 


n s 


2.18 


0.13 


6.24 


1.49 


0.07 


4.26 


normalization 


Q 


0.20 


0.11 


0.57 


0.18 


0.07 


0.50 


shift parameter 


•^shift 


1.58 


0.12 


4.53 


1.06 


0.07 


3.04 


fine structure constant 


C^dec 


0.66 


0.02 


1.88 


0.41 


0.01 


1.18 


reionization optical depth 


7~re 


26.93 


8.28 


77.02 


20.32 


5.89 


58.11 


Temperature and Polarization (TT+EE) 


baryon density 


U b 


0.91 


0.53 


2.61 


0.38 


0.21 


1.09 


matter density 




1.81 


0.12 


5.17 


0.67 


0.03 


1.91 


cosmological constant density 




30.89 


22.04 


88.36 


10.79 


9.85 


30.85 


spectral index 




0.97 


0.13 


2.77 


0.33 


0.05 


0.93 


normalization 


Q 


0.19 


0.10 


0.54 


0.14 


0.05 


0.41 


shift parameter 


•^shift 


1.43 


0.11 


4.08 


0.60 


0.03 


1.72 


fine structure constant 


C^dec 


0.34 


0.02 


0.97 


0.11 


< 0.01 


0.32 


reionization optical depth 




4.48 


2.65 


12.80 


1.80 


1.48 


5.15 



Table 6.10: Fisher matrix analysis results as in Table 6.9 but including «dec- 



170 



Beyond standard parameters 



3 



Planck 



2crc.l. 




Figure 6.23: Ellipses containing 95.4% (2a) of joint confidence (all other parameters marginal- 
ized) using temperature alone (red), E-polarization alone (yellow), and both jointly (white), 
for a standard model with inclusion of reionization (r re = 0.20) and time variations of the 
fine-structure constant. Fisher matrix forecast for the Planck HFI instrument. 
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Figure 6.24: Ellipses containing 95.4% (2a) of joint confidence (all other parameters marginal- 
ized) using temperature alone (red), E-polarization alone (yellow), and both jointly (white), 
for a standard model with inclusion of reionization (r re = 0.20) and time variations of the 
fine-structure constant. Fisher matrix forecast for an ideal cosmic variance limited (CVL) 
experiment. 
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Figure 6.25: Ellipses containing 95.4% (2a) of joint likelihood in the adec/«o — Tre plane (all 
other parameters marginalized), for the Planck and cosmic variance limited (CVL) experi- 
ments, using temperature alone (red), E-polarization alone (yellow), and both jointly (white). 
The dashed contour represents the WMAP - 4years forecast using (TT+EE+TE) jointly. 



Chapter 7 

Testing the paradigm of adiabaticity 



Combination of today's high quality CMB data with other cosmological data sets allows us 
to constrain the eight parameters 

9 = {^cdm, tt b , Q A , N u , h, T Te ,n s , A s } (7.1) 

with an accuracy of a few percent (Tegmark et al., 2004b), if we assume flatness, i.e. by 
imposing Qjc = 0. This is a spectacular achievement, even more so given the fact that many 
completely independent measurements seem to be converging towards the same values. In 
the previous sections we have discussed the determination of most of the above parameters; 
here we highlight that the accuracy of parameter extraction depends crucially on the assump- 
tion that the initial conditions for the perturbations are purely adiabatic, and explore the 
consequences of relaxing this strong assumption by including the most general type of initial 
conditions in the problem. 

This chapter is organized as follows: we first present an introductory survey on recent 
CMB analysis involving isocurvature modes, § 7.1; we then investigate in a specific example 
how the inclusion of isocurvature modes spoils the precise determination of the baryon density 
from pre- WMAP CMB data in § 7.2; in § 7.3 we ask whether the presence of non-adiabatic 
contribution can reproduce CMB and large scale structure observations without the need for a 
cosmological constant, and we conclude that ft\ ^ is robust with respect to the inclusion of 
isocurvature modes and to the use of a frequentist (rather than Bayesian) approach; finally, in 
§ 7.4 we give the future prospects for the determination by WMAP and Planck of cosmological 
parameters independent of any assumption about the type of initial conditions. 

7.1 Introductory survey 

Until recently, most of the literature has focused on parameter extraction assuming purely 
adiabatic initial conditions, because the evidence for a first acoustic peak around I « 220 
very soon ruled out the possibility of the simplest alternative, namely purely isocurvature 
CDM initial conditions, see e.g. Enqvist et al. (2000). Nevertheless, subdominant CDM 
isocurvature contributions cannot be excluded, and the constraints are even less stringent 
if one allows for a correlated mixture, in which case the correlator can cancel out most 
of the isocurvature contribution on large scale (Langlois & Riazuelo, 2000; Amendola et al., 
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2002). This qualitative conclusion holds even after the more precise measurements of WMAP 
(Valiviita & Muhonen, 2003). 

In the works of Bucher et al. (2001, 2002) the consequences for parameter extraction are 
examined when the most general initial conditions are allowed, with the conclusion that only 
a precise measurement of polarization would allow for the simultaneous reconstruction of 
cosmological parameters and of the initial conditions correlation matrix. The first attempt of 
including all the modes in a numerical parameter determination from real data is performed 
in Trotta et al. (2001), as illustrated in § 7.2, with the result that the pre- WMAP CMB data 
can not constrain to any extent the value of the baryon density and the Hubble parameter 
in the general initial conditions case. After the release of the WMAP first-year data, two 
groups have re-investigated the question of the most general initial conditions in the wake 
of the improved measurements: Crotty et al. (2003a) consider a correlated mixture of the 
adiabatic mode with each of the isocurvature modes in turn, finding that the pre- WMAP 
constraints on the isocurvature contribution are significantly improved; Bucher et al. (2004) 
refine the analysis of Trotta et al. (2001) by using Monte Carlo methods, and simultaneously 
including all the isocurvature modes and six cosmological parameters, but the conclusions 
remained qualitatively the same. The bottom line is that the relaxing the assumption of 
adiabaticity spoils our ability to do precision cosmology. 

The phenomenological approach gives useful hints on the "stiffness" of current data, and 
indeed the possibility of accommodating isocurvature modes has been considerably reduced 
by WMAP. Although independent of any model for the generation of perturbations, this 
approach has the disadvantage of introducing many new free parameters in the description of 
the power spectrum. To reduce this number somewhat, all analyses so far have assumed the 
same spectral index for all modes, an assumption which is not really motivated. Since the 
current CMB data are in excellent agreement with purely adiabatic initial conditions, it is not 
surprising however that there is no statistical evidence that such extra parameters should be 
non-zero. Occam's razor would therefore dictate to stick to the simplest adiabatic description, 
lacking any evidence for a more complicated model. However, there is no compelling reason 
why the physics of the early universe should boil down to only one degree of freedom. 

A second reason why model-independent constraints should be regarded with care is that 
in any specific implementation, some of the parameters will be correlated. For instance, in 
the curvaton scenario (Moroi & Takahashi, 2001; Lyth & Wands, 2002; Enqvist & Sloth, 
2002; Lyth et al., 2003), the adiabatic and residual isocurvature modes are always totally 
correlated or anti-correlated. Therefore, not only the number of extra degrees of freedom is 
reduced, but possibly the parameter space of the model is a highly constrained subspace of the 
model-independent parameter space. For this reason it is interesting to derive model-specific 
constraints, which are more stringent than those obtained with a general phenomenological 
parametrization. For instance, WMAP constraints for the curvaton model have been derived 
for the case of CDM and baryons isocurvature fluctuations (Gordon & Lewis, 2003; Lyth 
k, Wands, 2003). The neutrino density mode can be generated from perturbations of the 
neutrino chemical potential (Lyth et al., 2003), and bounds have recently been derived for 
this case (Gordon &: Malik, 2004). It seems more difficult to produce a neutrino velocity 
mode: a working model is at present still lacking. 

Despite these difficulties, the CMB represents the most promising data set to learn about 
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the type of initial conditions realized in the observed Universe: it is our window to the very 
early universe. 

7.2 Precision cosmology and general initial conditions 

In this section, based on the work published in Trotta et al. (2001), we investigate the extent 
to which the determination of cosmological parameters depends on the assumptions about 
initial conditions. We show in a specific example how the allowed parameter range is enlarged 
when the usual requirement for purely adiabatic initial conditions is relaxed. In order to limit 
the computational effort, we have chosen to vary some cosmological parameters and keep the 
others fixed. We consider flat models only, and we fix the total density parameter, the total 
matter density and the cosmological constant density parameter as follows: 

^tot = ^A + = 1 j 
$7 m = O c dm + = 0.3 , (7.2) 

n A = 0.7, 

where f2 c dm and fib are the density parameters of cold dark matter (CDM) and baryons 
respectively, and fiA denotes the density parameter due to a cosmological constant, 17a = 
A/3Hq, and Hq = lOO/i km s _1 Mpc -1 is the Hubble parameter today. With fiA fixed to 
the above values, we then vary the Hubble parameter h, the baryon density = fib/i 2 
and the correlation matrix M which describes the most general (i.e. mixed adiabatic and 
isocurvature) initial conditions, as explained in § 4.3. We also fix to unity the scalar spectral 
index, n s = 1 for all modes and cross-correlators. Even by varying only two cosmological 
parameters, our parameters space is still 12-dimensional, since the initial condition correlation 
matrix introduces ten free amplitudes. 

We also investigate the following question: what is the preferred isocurvature contribution 
to the perturbations? We shall see that, with pre-WMAP CMB data, this question cannot 
be answered without strong assumptions about the cosmological parameters. 

7.2.1 Pre-WMAP data analysis 

Our analysis uses the COBE (Tegmark & Hamilton, 1997) and BOOMERanG (Netterfield 
et al., 2002) data. For the latter, we take into account the calibration and the beam size 
uncertainties which treated just like two additional (normally distributed) parameters of the 
problem ("nuisance parameters"). The two cosmological parameters h, are sampled on a 
uniform grid as follows (the number in parenthesis is the step size): 

0.50 < h < 0.80 (0.05) , (7.3) 
0.015 < to h < 0.085 (0.005) . (7.4) 

For each grid point, we search the initial condition space by minimizing the chi-square, as 
explained in § 5.1.5. We look for the best fit point by using a downhill simplex method (Press 
et al., 1992) initiated after choosing a starting point randomly. The positive semi-definiteness 
of the correlation matrix M is ensured by penalty functions which guarantee that the con- 
ditions (4.52, page 95) are satisfied (more details are given in Trotta, 2001). The best fit is 
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Figure 7.1: CMB anisotropy temperature spectrum for different values of the cosmological 
parameters Wb and h. We plot the best-fit corresponding to a purely adiabatic case (dashed 
line) and allowing general initial conditions, mixed models (solid line). The calibration and 
the beam size of the BOOMERanG data have been optimized to fit the mixed model (solid 
error bars) or the adiabatic model (dotted error bars). The parameter choice in the left panel 
(o»b = 0.02, h = 0.65) can be fitted by both models while the values uj^ = 0.042, h = 0.65 
(right panel), can only be fitted by a mixed model. 

then estimated after 15, 000 minimization runs using this procedure. It turns out that the 
topology of the x 2 surface on our 14-dimensional parameter space (including the two above 
nuisance parameters) is quite complicated with many local minima and large degeneracies, 
which considerably complicates the numerical search. We assume that the likelihood function 
is Gaussian, and we maximize instead of marginalize over the parameter we are not interested 
in, see § 5.1.5. 

In Fig. 7.1 we show the best-fit spectra for two different choices of the cosmological pa- 
rameters Wb and h. Both of them are good fits if we allow for mixed initial conditions. 
On the plot we have also indicated the reduced x 2 , he. the value of X 2 /F, where F is the 
number of degrees of freedom of the fit. For a fixed choice of Wb, h the purely adiabatic 
model has only three parameters (the amplitude of the adiabatic mode, and the two nuisance 
parameters). With 26 data points (7 from COBE and 19 from BOOMERanG) this leads to 
Fad = 26 — 3 = 23 degrees of freedom. The mixed models have a symmetric 4x4 matrix 
determining the initial amplitude, leading to a total of 12 parameters and hence Fmix = 14 
degrees of freedom. If we also vary Wb and h, the number of degrees of freedom is lowered by 
two. It is not surprising that for fixed values h = 0.65, u>b = 0.02, which are well fitted by 
the adiabatic model, the reduced x 2 of the adiabatic model is somewhat lower than the one 
of the mixed model, since -Fmix < Fad (as an example, see top panel of Fig. 7.1). For the 
mixed model, the absolute x 2 is always lower. 

For both models we determine the likelihood functions of the cosmological parameters Wb 
and h by maximizing the initial conditions correlation matrix and the nuisance parameters. 
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Figure 7.2: Left panel: the contours of 68%, 95%, 99% likelihood content in the (wb, h) plane 
for purely adiabatic models (shadows of green, smaller contours) and for mixed models (red to 
yellow, large contours). The likelihoods are obtained by maximizing the nuisance parameters, 
and the initial conditions correlation matrix M for mixed (i.e. general isocurvature) models. 
For mixed models, the lowest x 2 corresponds to even higher values of u>b and h than those 
shown in the plot. Right panel: the isocurvature content 7 defined in (7.5) of the best fit mixed 
model as function of the parameters (uj^, h). A larger value for 7 indicates a predominance 
of the isocurvature modes on the adiabatic one. 



The result is shown in the left panel of Fig. 7.2 where the likelihood contours in the (ub,h) 
plane are indicated for purely adiabatic and for mixed (general isocurvature) models. It 
is remarkable the extent to which the innermost la contour opens up, once we allow for 
isocurvature components. Strangely, the least likely region is the upper left corner which 
contains the value of u>b = 0.019 ± 0.02 inferred from BBN (Buries et al., 2001) and the 
Hubble space telescope key project value for the Hubble parameter (Freedman et al., 2001) 
of h = 0.72 ± 0.08. Moreover, there is absolutely no upper limit for ui^ within the range 
investigated here! This is explained by the fact that the strongest features of a high baryon 
density universe, the asymmetry between even and odd acoustic peaks and the shift of the 
peak position due to the change in the sound velocity, can be fully compensated by an 
admixture of isocurvature modes (see left panel of Fig. 7.1). A very high baryon density can 
therefore easily be accommodated into this framework. However, for high lu^ and low h, it 
is difficult to find a good fit because there is not enough power in the secondary peak region 
due to the early integrated Sachs- Wolfe effect boosting the first peak. 
We define the isocurvature content of a mixed model as 

M 22 + M 33 + M 44 
7 55 t^M ' (7 - 5) 

where M\\ denotes the adiabatic mode amplitude. The isocurvature content of the model 
shown in the left panel of Fig. 7.1 is only 7 = 0.12, while for the parameter choice in the right 
panel one has 7 = 0.69. Hence, if the cosmological parameters are close to those chosen in 
the left panel, we can conclude that the cosmic perturbations are predominantly adiabatic. 

In the right panel of Fig. 7.2 we plot the isocurvature content, 7, of the best fit model 
obtained by minimizing x 2 by variation of the initial conditions for given values of the cos- 
mological parameters. Clearly, the further away we move from the region of parameter space 
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well fitted by the purely adiabatic model, the higher the isocurvature contribution needed to 
fit the data becomes. 

The main non-adiabatic component of our best fits is the neutrino entropy mode. This 
was to be expected, since this mode and its correlator with the adiabatic mode can shift the 
peak positions and can substantially add or subtract power from the second peak (Bucher 
et al., 2000). A crucial point is, therefore, to know whether such a mode can appear in 
a realistic structure formation scenario. It is known that for interacting species the non- 
adiabatic part of the perturbations tends to decay with time. Therefore, the generation of a 
neutrino entropy component can only occur after neutrino decoupling, that is at T < 1 MeV 
(see Gordon k, Malik, 2004 for a discussion). A neutrino isocurvature perturbation could also 
be due to a fourth species of sterile neutrinos which may have decoupled very early in the 
history of the Universe. The same remark also applies of course to the CDM isocurvature 
mode. Note that the energy density of this fourth neutrino type cannot be very high, in order 
not to contradict the light element abundances, but there is nothing which prevents (at least 
in principle) the presence of large perturbations in this component. 

7.2.2 How important is the assumption of adiabaticity? 

We have shown that in allowing for isocurvature perturbations, one can fit very well pre- 
WMAP CMB data with cosmological parameters which differ considerably from the ones 
preferred by adiabatic perturbations alone. More importantly, allowing for generic initial 
conditions, the ranges of cosmological parameters which can fit the CMB anisotropy data 
open up to an extent to become nearly meaningless. On the other hand, assuming measure- 
ments of cosmological parameters from other methods like direct measurements of the Hubble 
parameter which yield h ~ 0.65 and BBN which implies uj\> ~ 0.02, we can use the CMB to 
limit the isocurvature contribution in the initial conditions (or other unconventional features) 
and thereby learn something about the very early universe, i.e., the inflationary phase which 
has generated these initial conditions. For cosmological parameters in the range preferred 
by other CMB independent measurements (Ha ~ 0.7, f2 m ~ 0.3, h ~ 0.65, ~ 0.02) the 
isocurvature contribution in the initial conditions has to be relatively modest (7 < 0.3). We 
have also checked explicitly that, given these cosmological parameters, a purely isocurvature 
model, i.e. one with M\\ = 0, cannot fit the data. 

Finally, and most importantly, our work shows the danger of calling parameter estimation 
by CMB anisotropy experiments a "parameter measurement" since the results depend so sen- 
sitively (and quite unexpectedly) on the underlying model assumptions. We rather consider 
CMB anisotropies as an excellent tool to test model assumptions or consistency. In the light 
of these findings, non-CMB measurements of cosmological parameters acquire even more im- 
portance. In short, CMB is the ideal tool to investigate the primordial parameters for cosmic 
structure formation (i.e. the initial conditions), while there are many other possibilities to 
constrain cosmological parameters (Clx, h, etc), which we have to use in order to obtain good 
limits for possible isocurvature perturbations. 

As shown in Bucher et al. (2001) and discussed in § 7.4, CMB temperature anisotropies 
alone, even if measured with optimal precision limited by cosmic variance, do not allow the 
degeneracy between cosmological parameters and initial conditions to be removed. Polariza- 
tion measurements represent an additional non-trivial means to lift this degeneracy and might 
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constrain the contribution of the isocurvature modes to about 10% accuracy (Bucher et al., 
2001). The main reason for this is that polarization is mostly sensitive to the quadrupole 
of the photon distribution rather than the photon density perturbation, these two quantities 
depending in a different way on the initial conditions. In the same vein, using the normal- 
ization of the matter power spectrum (provided it can be measured accurately) also helps to 
break some of the degeneracies induced by the isocurvature modes, as we show in the next 
section. 
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7.3 The cosmological constant problem 

Ever since the beginning of modern cosmology, one of the most enigmatic ingredients has 
been the cosmological constant. Einstein (1917) introduced it to find static cosmological 
solutions (which are, however, unstable). Later, when the expansion of the Universe had 
been established, he reportedly called it his "greatest blunder". In relativistic quantum field 
theory, for symmetry reasons the vacuum energy momentum tensor is of the form eg^ u for 
some constant energy density e. The quantity A = SirGe can be interpreted as a cosmological 
constant. Typical values of e expected from particle physics come, for example, from the 
super-symmetry breaking scale which is expected to be of the order of e > 1 TeV 4 leading to 
A > 1.7 x 10 -26 GeV 2 , and corresponding to Oa ^ 10 58 . Recall that for the density parameter 

= e/pcrit = A/(87rGp cr it), where p CI a = 8.1 x 10~ 47 h 2 GeV 4 is the critical density and the 
fudge factor h is defined by Hq = 100 h km s _1 Mpc -1 , lying in the interval 0.5 < h < 0.8. 
Hq is the Hubble parameter today. 

Such a result is clearly in contradiction with kinematical observations of the expansion 
of the universe, which tell us that the value of O to t, the density parameter for the total 
matter-energy content of the universe, is of the order of unity, O(0 to t) ~ 1- For a long time, 
this apparent contradiction has been accepted by most cosmologists and particle physicists, 
convinced that there must be some deep, not yet understood reason that vacuum energy 
— which is not felt by gauge-interactions — does not affect the gravitational field either, 
and hence we measure effectively A = 0. This slightly unsatisfactory situation became 
really disturbing in 1998, as two groups, which had measured luminosity distances to type 
la supernovae, independently announced that the expansion of the universe is accelerated 
in the way expected in a universe dominated by a cosmological constant (Riess et al., 1998; 
Perlmutter et al., 1999). More recent measurements, which extend to higher redshift, seem 
to strengthen this conclusion (Tonry et al., 2003; Riess et al., 2004), obtaining values of the 
order 0{Vt m ) ~ O(^a) ~ 1 and cannot be explained by any sensible high energy physics 
model. Tracking scalar fields or quintessence (Ratra & Peebles, 1988; Wetterich, 1988) and 
other similar ideas (Ferreira &z Joyce, 1997) have been introduced in order to mitigate the 
smallness problem — i.e., the fact that e ~ 10~ 46 GeV 4 . However, none of those is completely 
successful and really convincing at the moment, see Straumann (2003); Sahni (2004) for 
reviews. 

7.3.1 Does structure formation need a cosmological constant? 

After the supernovae la results, cosmologists have found many other data-sets which also 
require a non- vanishing cosmological constant. The most prominent fact is that CMB 
anisotropics indicate a flat universe, Qtot = + = 1, while measurements of clus- 
tering of matter, e.g., the galaxy power spectrum, require T = hVL m ~ 0.2. But also CMB 
data alone, with some reasonable prior on the Hubble parameter, point to JIa > at high 
significance (Spergel et al., 2003). 

This cosmological constant problem is probably the greatest enigma in present cosmology. 
The supernova results are therefore under detailed scrutiny, and there has been a significant 
amount of work aiming at finding an alternative explanation for the data, see e.g. Meszaros 
(2002); Blanchard et al. (2003); Alam et al. (2004). Cosmological observations are usually 
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very sensitive to systematic errors which are often very difficult to discover. Therefore, in 
cosmology an observational result is usually accepted by the scientific community only if 
several independent data-sets lead to the same conclusion. But this seems to be exactly the 
case for the cosmological constant. 

It is therefore imperative to investigate in detail whether present structure formation data 
does require a cosmological constant, by asking whether enlarging the space of models for 
structure formation does mitigate the cosmological constant problem. There are several ways 
to enlarge the model space, e.g. one may allow for features in the primordial power spectrum, 
like a kink (Barriga et al., 2001). Here we study the cosmological constant problem in relation 
to the initial conditions for the cosmological perturbations. 

In a first step we discuss once more the usual results obtained assuming purely adiabatic 
models and we investigate the extent to which pre-WMAP CMB data alone or combined with 
large-scale structure measurements require Q\ / in a flat universe, presenting the findings 
published in Trotta et al. (2003). We shall first proceed with the usual Bayesian analysis, 
but we also discuss the results which are obtained in a frequentist approach. We find that 
even if Q\ = is outside the high likelihood region in a Bayesian approach this is no longer 
the case from the frequentist point of view. In other words the probability that a model with 
vanishing f^A leads to the present-day observed CMB and large-scale structure data is not 
exceedingly small. 

We then study how the results are modified if we allow for general isocurvature contribu- 
tions to the initial conditions. In this first study of the matter power spectrum from general 
isocurvature modes we discover that a COBE-normalized matter power spectrum reproduces 
the observed amplitude only if it is highly dominated by the adiabatic component. Hence the 
isocurvature modes cannot contribute significantly to the matter power spectrum and do not 
lead to a degeneracy in the initial conditions for the matter power spectrum when combined 
with CMB data. 

7.3.2 CMB and large scale structure data analysis 

The pre-WMAP CMB measurements, from BOOMERanG (Netterfield et al., 2002), MAX- 
IMA (Lee et al., 2001), DASI (Halverson et al., 2002), VSA (Scott et al., 2003; Taylor et al., 
2003), CBI (Pearson et al., 2003) and Archeops (Benoit et al., 2003a) are in very good agree- 
ment up to the third peak in the angular temperature power spectrum of CMB anisotropies, 
I ~ 1000. In our analysis we therefore use the COBE data (Smoot et al., 1992; Bennett et al., 
1994) in the decorrelated compilation of Tegmark h Hamilton (1997) (7 points excluding the 
quadrupole) for the I region 3 < I < 20 and the BOOMERanG data to cover the higher 
£ part of the spectrum (19 points in the range 100 < I < 1000). Since Archeops has the 
smallest error bars in the region of the first acoustic peak, we also include this data-set (16 
points in the range 15 < £ < 350). Including any of the other mentioned data does not influ- 
ence our results significantly. The BOOMERanG and Archeops absolute calibration errors 
(10% and 7% at la, respectively) as well as the uncertainty of the BOOMERanG beam size 
are included as additional Gaussian nuisance parameters, and are maximized over. We make 
use of the Archeops window functions available from the Archeops Website (2003), while 
for BOOMERanG a top-hat window is assumed. For the matter power spectrum, we use 
the galaxy-galaxy power spectrum from the 2dF data which is obtained from the redshift of 
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about 10 5 galaxies (Tegmark et al., 2002). We include only the 22 decorrelated points in the 
linear regime, i.e., in the range 0.017 < k < 0.314 [h Mpc -1 ], and the window functions of 
Tegmark et al. (2002) which can be found at Tegmark's Website (2003). 

Our grid of models is restricted to flat universes and we assume purely scalar perturbations. 
Since the goal here is more to make a conceptual point than to consider the most generic 
model, we fix the baryon density to the BBN preferred value fl^h 2 = ujy, = 0.020 (Buries 
et al., 2001) and we investigate the following 3-dimensional grid in the space of cosmological 
parameters: 

0.35 < h < 1.00 (0.025) , 

0.00 < A < 0.95 (0.05) , (7.6) 
0.80 < n s < 1.20 (0.05) , 

where n s is the scalar spectral index, which again we take to be the same for all modes, and the 
numbers in parenthesis give the step size we use. The total matter content O m = Q c d m + 
is f2 m = 1 — Qji, and f2 c dm indicates the cold dark matter contribution. For all models the 
optical depth of reionization is r = and we have three families of massless neutrinos. For 
each grid point we compute the ten CMB and matter power spectra, one for each independent 
set of initial conditions, as explained in § 4.3. The initial condition correlation matrix M is 
parameterized using the ten dimensional hypercube parameters presented on page 96. 

For a given initial conditions correlation matrix M and spectral index n s , we quantify 
the isocurvature contribution to the CMB temperature anisotropy by the phenomenological 
parameter (3 defined as 

£ <(/(* + l))Cf" x) )< 
x^ci™ 

£ <^+i)crv 

y=AD,CI,NV,ND 

where the average (•) is taken in the t range of interest, in our case 3 < t < 1000, and 
where C^ x ' x) stands for the auto-correlator of the CMB anisotropies with initial conditions 
X. This quantity measures the average power of the adiabatic and isocurvature modes over 
the full multipole range, and therefore it gives a more phenomenological description of the 
isocurvature contribution than the parameter 7 used in the previous section, and defined in 
Eq. (7.5, page 177). 

As highlighted in § 5.1.5, the correct interpretation of Bayesian statistics is in terms of 
most likely regions in parameter space, while the frequentist approach is required in order to 
obtain exclusion intervals for the parameters. In order to answer the question of whether the 
CMB and large scale structure data exclude with a given confidence the value Oa = 0, we 
use the frequentist statistics, and compare the result with the usual Bayesian approach. 

7.3.3 Adiabatic perturbations 

We first fit CMB data only (N = 42) by maximizing M = 7 parameters, i.e., the three 
nuisance parameters, n s , h, f?A and the overall amplitude of the adiabatic spectrum, and we 
find (Bayesian likelihood intervals on $7 a alone): 

n A = 0.80±g;^ at 2a and torn at 3a. (7.8) 
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Figure 7.3: Joint likelihood contours (Bayesian, left panel) and confidence contours (frequen- 
tist, right panel), with CMB only (solid lines, la, 2a, 3a contours) and CMB+2dF (filled) 
for purely adiabatic initial conditions. In the right panel, the number of effective degrees of 
freedom is F cS = 31 for CMB alone F cff = 50 for CMB+2dF. 

The asymmetry in the intervals arises because the value of Oa for our maximum likelihood 
(ML) model is relatively large. One could achieve a better precision in determining the ML 
value of by using a finer grid and varying oj^ as well, which has extensively been done in 
the literature and is not the scope of this work. Moreover, the position of the acoustic peaks 
in CMB anisotropies is mostly sensitive to the age of the universe at recombination, which 
depends only on fl m h 2 , and to the angular diameter distance, which depends on O m , Q\ 
and the curvature of the universe. When the universe is flat, the angular diameter distance 
is weakly dependent on the relative amounts of f2 m and ft\ as long as Oa is not too large, 
see § 4.1.2 and Fig. 4.1 on page 84. Hence, one can achieve a sufficiently low value of Q, m h 2 
either via a large cosmological constant or via a very low Hubble parameter, h<0A5. 

We now include the matter power spectrum P m , assuming P m = b 2 P g , where P g is the 
observed galaxy power spectrum and b some unknown bias factor (assumed to be scale in- 
dependent), over which we maximize. Inclusion of this data in the analysis breaks the 
h degeneracy, since P m is mainly sensitive to the shape parameter T = Cl m h. We therefore 
obtain significantly tighter overall likelihood intervals for 

n A = 0.70j^;^ at 2a and toir at 3a. (7.9) 

We plot joint likelihood contours (Bayesian) for Q\, h with purely adiabatic initial conditions 
in the left panel of Fig. 7.3. From the Bayesian analysis, one concludes that CMB and 2dF 
together require a non-zero cosmological constant at very high significance, more than 7a 
for the points in our grid! Note that the ML point has a reduced chi-square Xf =5 q = 0.59, 
significantly less than unity. 

The frequentist analysis, however, excludes a much smaller region of parameter space, cf. 
the right panel of Fig. 7.3. The frequentist contours must be drawn for the effective number 
of degrees of freedom, i.e., using the number of effectively independent data points. We can 
therefore roughly take into account a 10% correlation, which is the maximum correlation 
between data points given in Netterfield et al. (2002); Benoit et al. (2003a), by replacing F 
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by the effective number of degrees of freedom, F c g = 0.9N — M, and rounding to the next 
larger integer (to be conservative). One could argue that the BOOMERanG and Archeops 
data points are not completely independent, since BOOMERanG observed a portion of the 
same sky patch as measured by Archeops. This possible correlation is difficult to quantify, 
but should not be too important since the sky portion observed by Archeops is a factor of 
10 larger than BOOMERanG's and therefore we ignore it here. The right panel of Fig. 7.3 
is drawn with F e R = 31 for CMB alone and F c s = 50 for CMB+2dF, but we have checked 
that our results do not change much if we use a 5% correlation. 

It is interesting to note that there are regions in the left panel which are excluded with 
a certain confidence by CMB data alone but are no longer excluded at the same confidence 
when we include the 2dF data. In other words, it would seem that taking into account more 
data and therefore more knowledge about the universe, does not systematically exclude more 
models, i.e., the CMB+2dF contours are not always contained in the CMB alone contours. 
This apparent contradiction vanishes when one realizes that the confidence limits on, e.g., 
f&A alone in the frequentist approach are just the projection of the confidence contours of 
the right panel on the Q\ axis. One can readily verify in the right panel that the confidence 
limits for the combined data-set are always smaller than the ones for CMB data alone. There 
are points with Oa = and h ~ 0.40 which are still compatible within 2a with both 2dF 
and CMB data, at the price of pushing somewhat the other parameters. In the best fit with 
f&A = shown in Fig. 7.4, one has to live with a red spectral index n s = 0.80. Furthermore, 
the calibration of the BOOMERanG and Archeops data points is reduced in this fit by 34% 
and 26%, respectively, i.e., more than 3 times the quoted la systematic error. 

In both cases, it is clear that one can exploit the J7a, h degeneracy to fit CMB data 
alone with a model having Q\ = 0. For a flat universe like the one we are considering, 
one has then to use a much smaller value of the Hubble parameter than the one indicated 
by other measurements, most notably the HST Key Project (Freedman et al., 2001), which 
gives h = 0.72 ± 0.08. The 2dF data are mainly sensitive to the shape parameter T ~ 0.2, 
hence 2dF with O m = 1.0 would require an even lower value of h which is not compatible with 
CMB. Therefore inclusion of 2dF data tends to exclude any flat model without a cosmological 
constant. Summing up, for purely adiabatic initial conditions the Bayesian approach gives 
very strong support to Oa / 0; in the more conservative frequentist point of view, while Qa ^ 
cannot be excluded with very high confidence, the combination of 2dF and pre-WMAP 
CMB data start to be incompatible with a flat universe with vanishing cosmological constant. 
These conclusions are in qualitative agreement with previous works using comparable data 
(Netterfield et al., 2002; Pryke et al., 2002; Lewis & Bridle, 2002; Wang et al, 2002; Durrer 
et al., 2003b; Rubino-Martin et al., 2003; Benoit et al., 2003b). In the next section we 
investigate the stability of these well known results with respect to inclusion of non-adiabatic 
initial conditions. 

7.3.4 Mixed adiabatic and isocurvature perturbations 

We now enlarge the space of models by including all possible isocurvature modes with ar- 
bitrary correlations among themselves and the adiabatic mode as described in the previous 
section, but with the restriction that all modes have the same spectral index. We first consider 
CMB data only and maximize over initial conditions. The number of parameters increases 
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Figure 7.4: Best fit with S7a = and purely adiabatic initial conditions, compatible with 
CMB and 2dF data within 2a confidence level (frequentist). In the right panel, only the 2dF 
data points left of the vertical, dotted line — i.e., in the linear region — have been included 
in the analysis. Note the low CMB first acoustic peak in the left panel due to the joint effect 
of the red spectral index and of the absence of early ISW effect. In this fit, the calibration of 
BOOMERanG (red/dark gray errorbars) and Archeops (green/light gray errorbars) has been 
reduced by 34% and 26%, respectively. To appreciate the difference, we plot the non recal- 
ibrated value of the BOOMERanG and Archeops data points as diagonal/magenta crosses 
and vertical/light blue crosses, respectively. Even though the fit is "by eye" very good, it 
seems highly unlikely that the calibration error is so large. 



by nine and the number of degrees of freedom decreases correspondingly with respect to the 
purely adiabatic case considered above. 

Likelihood (Bayesian, left panel of Fig. 7.5) and confidence (frequentist, right panel of 
Fig. 7.5) contours widen up somewhat along the degeneracy line. The enlargement is less 
dramatic than in the case of the baryon density presented in § 7.2. This is partially due to 
our prior of flatness which reduces the space of models to those which are almost degenerate 
in the angular diameter distance. Most of our models have the first acoustic peak of the 
adiabatic mode already in the region preferred by experiments, hence in most of the fits, 
isocurvature modes play a modest role, especially in the parameter regions with large f^A, 
h (cf. Fig. 7.9 and the discussion below). Nevertheless, because of the Q\, h degeneracy, 
even a modest widening of the contours along the degeneracy line results in an important 
enlargement of the likelihood limits. The ML point does not depart very much from the 
purely adiabatic case, but now we cannot constrain Q\ at more than Iff (Bayesian, CMB 
only): 

n A = 0.85±° ° 3 l at lo- , (7.10) 

and no limits for 0.0 < Ha < 0.95 at higher confidence. 

In Fig. 7.6 we plot the dark matter power spectra of the different auto- (left panel) and 
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Figure 7.5: Joint likelihood contours (Bayesian, left panel) and confidence contours (frequen- 
tist, right panel), with CMB only (solid lines) and CMB+2dF (filled) after maximization over 
general isocurvature initial conditions. The likelihood/probability content is la, 2a, 3a, from 
the center to the outside. The disconnected la region in the left panel is an artificial feature 
due to the grid resolution. In the right panel, the number of effective degrees of freedom is 
F cS = 22 for CMB alone F cff = 41 for CMB+2dF. 

cross-correlators (right panel) for a concordance model. The norm of each pure mode (AD, CI, 
ND, NV) is chosen such that the corresponding CMB power spectrum is COBE-normalized. 
The cross-correlators are normalized according to totally correlated spectra, i.e. 

M ix ,Y) = v^MxMy/2 , (7.11) 

where M( X ,y) denotes the norm of the cross-correlator between the modes X,Y and Mx 
the norm of the pure mode X. A crucial result is that the COBE-normalized amplitude 
of the adiabatic matter power spectrum is nearly two orders of magnitude larger than the 
isocurvature contribution. The main reason for this is the amplitude of the Sachs- Wolfe 
plateau which is about ^$ for adiabatic perturbations and 2$ for isocurvature perturbations, 
where <3? is the gravitational potential at last scattering, see Eq. (4.3) and Eq. (4.4, page 79). 
This difference of a factor of about 36 in the power spectrum on large scales is clearly visible 
in the comparison of Pad and Pci (the difference increases at smaller scales). The case of 
the neutrino modes is even worse since they start with vanishing dark matter perturbations. 
That the CDM isocurvature matter power spectrum is much lower than the adiabatic one 
has been known for some time (see e.g. Stompor et al., 1996; Pierpaoli et al., 1999). However, 
it was not recognized before that the same holds true for the neutrino isocurvature matter 
power spectra as well, and - more importantly - that this leads to a way to break the strong 
degeneracy among initial conditions which is present in the CMB power spectrum alone. 

In an analysis with general initial conditions including the 2dF data only we obtain very 
broad likelihood and confidence contours which exclude only the lower right corner of the 
(f^A, h) plane. In contrast to the CMB power spectrum, the matter power spectrum can be 
fitted with extremely high adiabatic and isocurvature contributions, which are then typically 
cancelled by large anti-correlations between the spectra. This behavior is exemplified for 
a model with general isocurvature initial conditions and Oa = 0.70, h = 0.65, n s = 1.0 



7.3 The cosmological constant problem 



187 




10- 4 10- 3 10- 2 10-' 10° 10' 10- 4 10" 3 10" 2 10"' 10° 10 1 

k (h Mpc-') k (h Mpc" 1 ) 



Figure 7.6: Dark matter power spectra of the different auto- (left panel) and cross-correlators 
(right panel) for a concordance model with Oa = 0.70, h = 0.65, n s = 1.0, u>b = 0.020, 
with the corresponding CMB power spectrum COBE-normalized. The color and line style 
codes are as follows: in the left panel, adiabatic (AD): solid/black line; CDM isocurva- 
ture (CI): dotted/green line; neutrino density (ND): short-dashed/red line; neutrino ve- 
locity (NV): long-dashed/blue line; in the right panel, AD: solid/black line (for compari- 
son), (AD, CI): long-dashed/magenta line, (AD,ND): dotted/green line, (AD,NV): short- 
dashed/red line, (CI, ND): dot-short dashed/blue line, (CI, NV): dot-long dashed/light-blue 
line, and (ND,NV): dot-short dashed/black line. The adiabatic mode is by far dominant 
over all others. 



in Fig. 7.7. The best fits with 2dF data only are dominated by large isocurvature cross- 
correlations. Clearly, the resulting CMB power spectrum is highly inconsistent with the 
COBE data. Hence such "bizarre" possibilities are immediately ruled out once we include 
CMB data. Conversely, moderate isocurvature contributions can help fitting the CMB data, 
and do not influence the matter power spectrum, which is completely dominated by the 
adiabatic mode alone. 

Combining CMB and 2dF data we find now (Bayesian, mixed isocurvature models): 

n A = 0.65±g;|§ at 2a and ±g;|| at 3a. (7.12) 

The likelihood limits are larger than for the purely adiabatic case but it is interesting that the 
Bayesian analysis still excludes 0,\ = at more than 3a even with general initial conditions, 
for the class of models considered here. Because of the above explained reason, the widening 
of the limits is not as drastic as one might fear. Therefore, combination of CMB and LSS 
measurements turn out to be an ideal tool to constrain the isocurvature contribution to the 
initial conditions. 

From the frequentist point of view, one notices that the region in the Q\,h plane which 
is incompatible with data at more than 3a is nearly independent on the choice of initial 
conditions (compare the right panels of Fig. 7.3 and Fig. 7.5). Enlarging the space of initial 
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Figure 7.7: Concordance model fit with general isocurvature initial conditions and 2dF data 
only. The total spectrum (solid/black) is the result of a large cancellation of the purely 
adiabatic part (long-dashed/red) by the large, negative sum of the various correlators (dot- 
ted/magenta, plotted in absolute value). The short-dashed/green curve is the sum of the 
three pure isocurvature modes, CI, ND and NV. Note that the resulting total spectrum is 
less than one tenth of the purely adiabatic part. 

conditions seemingly does not have a relevant benefit on fitting CMB and 2dF data with or 
without a cosmological constant. The reason for this is that the (COBE-normalized) matter 
power spectrum is dominated by its adiabatic component and therefore the requirement 
£l m h ~ 0.2 remains valid. In Fig. 7.8 we plot the best fit model with general initial conditions 
and Q,\ = 0. We summarize our likelihood and confidence intervals on JIa (this parameter 
only) in Table 7.1. 

In Fig. 7.9 we plot the isocurvature contribution to the best fit models with CMB and 2dF 
in terms of the parameter defined in (7.7). The best fit with Oa = has an isocurvature 
contribution of about 40%. We can put a constraint on the maximal isocurvature contribution 
allowed by combining this plot with the exclusion plot obtained with the frequentist approach, 
Fig. 7.5 right panel. The result is that frequentist statistics limits the isocurvature content 
(3 to be 

/3<0.4 (2ffd.). (7.13) 



7.3.5 Do isocurvature perturbations mitigate the A problem? 

There are three main conclusions we can draw from these results. The first one is not new, but 
seems to be dangerously forgotten in recent cosmological parameters estimation literature: 
namely that likelihood contours cannot be used as "exclusion plots". The latter are usually 
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Figure 7.8: Best fit with general isocurvature models and Oa = 0. As for the purely adiabatic 
case, even with general initial conditions the absence of the cosmological constant suppresses 
in an important way the height of the first peak. In both panels we plot the best total spec- 
trum (solid/black), the purely adiabatic contribution (long-dashed/red), the sum of the pure 
isocurvature modes (short-dashed/green) and the sum of the correlators (dotted/magenta, 
multiplied by —1 in the left panel and in absolute value in the right panel). The matter 
power spectrum is completely dominated by the adiabatic mode, while the correlators play 
an important role in cancelling unwanted contributions in the CMB power spectrum at the 
level of the first peak and especially in the COBE region. For this model we have an isocur- 
vature content j3 = 0.39, while the BOOMERanG and Archeops calibrations are reduced by 
28% and 12%, respectively. The color codes for the error-bars are the same as in Fig. 7.4. 



substantially wider, less stringent. A more rigorous possibility are frequentist probabilities, 
which however suffer from the dependence on the number of really independent measurements 
which is often very difficult to come by. 

Secondly, we have found that in COBE-normalized fluctuations, the matter power spec- 
trum has negligible isocurvature contributions and is essentially given by the adiabatic mode. 
Hence the shape of the observed matter power spectrum still requires Q m h ~ 0.2, indepen- 
dent of the choice of initial conditions. Due to this behavior, the condition = Oa + = 1 
requires either a cosmological constant or a very small value for the Hubble parameter, inde- 
pendently from the isocurvature contribution to the initial conditions. 

The third conclusion concerns the presence of a cosmological constant from pre-WMAP 
CMB data combined with the 2dF matter power spectrum: For flat models, a likelihood 
(Bayesian) analysis strongly favors a non- vanishing cosmological constant. Even if we allow 
for isocurvature contributions with arbitrary correlations, a vanishing cosmological constant 
is still outside the 3<r likelihood range. It is possible that there are open models, which we did 
not consider here, in which the NV mode would be dominant,: this because it presents a first 
acoustic peak at I = 170 in flat models, which would be displaced to a larger multipole value, 
as preferred by data, in an open Universe, thereby possibly giving a good fit to CMB data 
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Figure 7.9: Isocurvature content 0.0 < (3 < 1.0 of best fit models with CMB and 2dF data. 
The contours are for (3 = 0.20, 0.40, 0.60, 0.80 from the center to the outside. 



Purely adiabatic 


Data-sets 


n A 


Bayesian 1 
la 2a 3a 


Frequentist 2 
la 2a 3a 


F X 2 /F 


CMB 

CMB +2dF 


0.80 
0.70 


+0.08 +0.10 +0.12 
-0.08 -0.35 

+0.05 +0.13 +0.15 
-0.05 -0.17 -0.27 


< 0.93 

fi A >a?5 < 0.92 < 0.92 


35 0.58 
56 0.59 


General isocurvature 


CMB 

CMB+2dF 


0.85 
0.65 


+0.05 
-0.35 

+0.15 +0.22 +0.25 
-0.10 -0.25 -0.48 


< 0.90 < 0.92 < 0.95 


26 0.74 
47 0.67 



Likelihood interval. 
2 Region not excluded by data with given confidence. 



Table 7.1: Likelihood (Bayesian) and confidence (frequentist) intervals for Q\ alone (all other 
parameters maximized). A bar, — , indicates that at the given likelihood/confidence level the 
analysis cannot constraint Oa in the range 0.0 < fl\ < 0.95. Where the quoted interval is 
smaller than our grid resolution, an interpolation between models has been used. 

and allow for the observed shape parameter T with a reasonable value of h. This question 
remains to be investigated in detail. 

The situation changes considerably in the frequentist approach. There, even for purely 
adiabatic models, = is still within 3a for a value of h < 0.48 which is marginally 
defendable. The conclusion does not change very much when we allow for generic initial 
conditions. 
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7.4 Precision cosmology independent of initial conditions 

As we have seen, it is difficult to simultaneously constrain both the type of initial conditions 
and the cosmological parameters using CMB alone. The future high accuracy measurements 
of CMB polarization will help substantially in breaking degeneracies between initial condi- 
tions. The degeneracies in the parameter dependence of temperature and polarization are 
almost orthogonal, and polarization can therefore lift "flat directions" in parameter space. 

To determine cosmological parameters independently on the initial conditions, one in- 
cludes general isocurvature modes, and then marginalize over them. Bucher et al. (2002, 
2001) considered forecasts for WMAP and Planck, and found that admitting isocurvature 
modes would ruin the ability of WMAP to determine the cosmological parameters with tem- 
perature information only. They also highlighted that polarization measurements would be 
decisive in assisting into the reconstruction of the cosmological parameters when allowing 
for general isocurvature initial conditions. Their results were obtained with a Fisher matrix 
analysis on a cosmological parameter set which, according to Kosowsky et al. (2002), leads to 
large overestimates of the expected errors. We have reproduced their study (Trotta & Durrer, 
2004), using for the Fisher matrix forecast the normal parameter set described in § 4.2 so 
that we obtain forecasts not for the highly degenerate directions defined by the cosmological 
parameters, but rather for orthogonal combinations which are well measured by the CMB. 
Along these directions, forecasts are much more reliable. The main features are summarized 
in Fig. 7.10, where we plot the expected la error in percent for the six quantities which are 
directly probed by the CMB with good accuracy (see figure caption). We omit the energy 
density in the cosmological constant, which is ill-determined with CMB alone because of the 




Figure 7.10: Fisher matrix forecast for the percent la errors on six quantities which are 
well determined by CMB alone with and without inclusion of general isocurvature initial 
conditions. The left (right) panel is a forecast for WMAP four year mission (Planck). From 
left to right, on the abscissa axis: the baryon density, uif,, the angular diameter distance 
Da, the redshift of matter-radiation equality z eq , the scalar spectral index n s , the scalar 
adiabatic amplitude an d a function of the optical depth to reionization, r re . In the 
legend, "AD" means that only adiabatic fluctuations were included, "iso" means that general 
isocurvature modes were included and marginalized over. "TT" uses temperature information 
alone, "T+P" has temperature, E-T correlation and E-polarization. 
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geometrical degeneracy. We do not restrict our analysis to flat models, but include spaces 
with non-zero curvature. 

For WMAP the errors on normal parameters will increase roughly by a factor ten with 
respect to the purely adiabatic scenario if one marginalizes over general initial conditions, 
when temperature information alone is considered (cf. first and third bar in the left panel). 
When the full polarization information is included, however, the errors will still be within 
approximately 10 to 30% even in the general isocurvature scenario. From the right panel, we 
deduce that for the Planck experiment the worsening of the errors will be much less if the high 
quality polarization information is included. Roughly speaking, by including isocurvature 
modes we expect errors which are larger than in the adiabatic case by about a factor of two, 
but mostly still within the few percent accuracy. These findings are in qualitative agreement 
with Bucher et al. (2001), while providing a quantitatively more reliable estimate of the 
expected accuracy. 

This shows that the CMB alone will be able to provide high precision cosmology even if 
the strong assumption of purely adiabatic initial conditions will be relaxed. Combining CMB 
results with other observation which independently constrain the cosmological parameters, 
will enable us to fully open this window to the mysterious epoch of the very early universe. 
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